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This Appeal Brief is filed pursuant to the "Notice of Appeal to the Board of Patent 
Appeals and Interferences" mailed June 21, 2002 and received by the Office on June 25, 2002. 



Real Party in Interest 

The real party in interest in this appeal is Millennium Pharmaceuticals, Inc., the assignee 
of the above-referenced patent application. 



Related Appeals and Interferences. 

There are no related appeals and/or interferences involving this application or its subject 

matter. 
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Status of Claims. S 

1— « 

Claims 63-67, 77-79, and 87-104 are the subject of this appeal. The claims appear in § 

o 

Appendix A. Claims 1-62, 68-76, and 80-86 have been cancelled. § 
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Status of Amendments. 

All of Applicant's amendments have been entered. 

Summary of the Invention* 

The pending claims of the present invention are drawn to isolated nucleic acid molecules 
having a nucleotide sequence encoding the 21612 short chain dehydrogenase, nucleotide 
sequences encoding fragments of these polypeptides, and nucleotide sequences encoding 
functional variants of the 21612 short chain dehydrogenase, as well as host cells containing these 
nucleic acid molecules, methods and kits for detecting the 21612 nucleotide sequences, and 
methods of producing the encoded 21612 polypeptides, fragments, and variants. The 21612 
dehydrogenase gives a high score when compared to a consensus sequence for alcohol 
dehydrogenases. 

Issues. 

Issue 1 "Whether the invention of claims 63-67, 77-79, and 87-104 has utility under 35 
U.S.C. §101. 

Issue 2-Whether the invention of claims 63-67, 77-79, and 87-104 is enabled under 35 
U.S.C. § 1 12, first paragraph. 

Issue 3— Whether a sufficient written description of the invention of claims 88-92 is 
provided under 35 U.S.C. § 1 12, first paragraph. 

Issue 4— Whether claim 79 is indefinite under 35 U.S.C. § 1 12, second paragraph. 
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Grouping of Claims. 

The claims do not stand or fall together. While all the pending claims have been rejected 
under 35 U.S.C. § 101 and under 35 U.S.C. § 1 12, first paragraph, on the grounds that the 
claimed invention lacks utility and thus are not enabled, claims 88-92 have also been rejected 
under 35U.S.C.§112, first paragraph, on the grounds that the specification does not provide 
sufficient written description for the claimed subject matter. Furthermore, claim 79 has been 
rejected under 35 U.S.C. § 112, second paragraph, on the grounds that it is indefinite. 
Accordingly, the issues surrounding the claims are different, and the claims do not stand or fall 
together. 

Argument 

Issue l"Whether the invention of claims 63-67, 77-79, and 87-104 has utility under 35 U.S.C. 
§101. 

The Examiner has rejected claims 63-67, 77-79, and 87-104 under 35 U.S.C. §101 as 
lacking utility. The Examiner argues that the claimed invention lacks patentable utility because 
"[t]he specification fails to show a single working example that establishes that the SEQ ID 
N0:8 is a member of Alcohol dehydrogenase (ADH) family, such as by any substantial sequence 
homology and/or functional assay of the protein." March 25, 2002 Office Action, page 3. In 
fact, Applicant has determined that the 21612 protein fimctions as an alcohol dehydrogenase 
using methods that are reliable and are accepted by those of skill in the art. 
Accordingly, the claimed invention has a utility that is specific, substantial, and credible and 
therefore meets the requirements for patentability. 

I. The 21612 polypeptide is an ADH short chain dehydrogenase 

A. Pfam analysis demonstrates that the 21612 polypeptide is an ADH short chain 
dehydrogenase. 

The function of the novel 21612 polypeptide was determined by comparing the 21612 
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amino acid sequence set forth in SEQ ID N0:7 to the Pfam database of protein families. In this 
analysis, the 21612 polypeptide gave a high score when compared to the Pfam consensus 
sequence for ADH short chain dehydrogenases (Pfam Accession No. PF00106). A copy of the 
alignment of the 21612 polypeptide with the Pfam short chain dehydrogenase consensus 
sequence is provided as Appendix B. 

The Pfam database provides a curated collection of well-characterized protein family 
domains with high quality alignments. Functional domains of novel proteins may be identified 
by comparison with the Pfam protein family models. It is well known in the art that the presence 
of a consensus domain characteristic of a family of proteins having a know function may be used 
to determine the function of a novel polypeptide. The sequences included in the Pfam seed 
alignment used to create the short chain dehydrogenase consensus include proteins that have 
been well-characterized biochemically; for example an alcohol dehydrogenase fi'om Drosophila 
(NCBI Accession No, P21898), several human estradiol 17 p -dehydrogenases (NCBI Accession 
Nos. P37059, P51659, and P14061), a human corticosteroid 1 1-p-dehydrogenase (NCBI 
Accession No. P80365), and a human 15-hydroxyprotaglandin dehydrogenase (NCBI Accession 
No. PI 5428). 

The Examiner has stated that the Applicant's arguments regarding the reliability of the 
Pfam analysis are found unpersuasive "because PFAM analysis revealed that 21612 matches with 
a top-scoring domain for ADH-short but with low sequence similarity." March 25, 2002 Office 
Action, page 2, and August 14, 2002 Office Action, page 2. This statement not consistent with 
the understanding of one of skill in the art of Pfam alignments. As known to those of skill in the 
art (and described in the Pfam documentation available at http://pfam.wustl.edu/faq.shtml), Pfam 
alignments do not display homology between pairs of sequences but rather display the fit of a 
particular query sequence to a particular protein family model. Thus the measure of the strength 
of a match between the query sequence and the Pfam consensus alignment is the Pfam bit score, 
which shows the statistical significance of the fit between the query sequence and the Pfam 
consensus alignment. 
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A Pfam "bit score" represents the log base 2 of the ratio of the probability of the 
sequence, given the hypothesis that the sequence belongs to the protein family being modeled 
versus the probability of the sequence given the hypothesis that the sequence was generated 
according to a random background model. Thus, the bit score of 145 for the 21612 polypeptide 
when fit to the Pfam short chain dehydrogenase model means that the 21612 amino acid 
sequence is 2^"^^ (4.46 x 10"^^) times more likely to belong to the short chain dehydrogenase 
family than to contain the amino acid sequence shown in Appendix B by chance. Accordingly, 
the bit score obtained when the 21612 polypeptide is fit to the Pfam ADH short chain 
dehydrogenase model strongly supports the fimctional determination for the 21612 polypeptide 
described in the specification. 

B. The evidence presented by the Examiner does not support the Examiner ^s argument 
that the 21612 polypeptide is not an alcohol dehydrogenase. 

The Examiner argues the Applicant's assertion that the 21612 polypeptide functions as an 
alcohol dehydrogenase is not persuasive because the 21612 polypeptide does not share a high 
level of overall sequence identity with a known alcohol dehydrogenase. In support of this 
arguments, the Examiner states that in a search of the sequence databases to identify sequences 
sharing sequence identity with the 21612 polypeptides, the 21612 polypeptide shared 41.7 % 
sequence identity with a ribotol dehydrogenase from C elegans (NCBI Accession No. T 19954) 
and 12.7% sequence identity with a human alcohol dehydrogenase (NCBI Accession No. 
AA622988). As an initial matter, Applicant notes that the function of the 21612 polypeptide was 
not determined based on overall sequence identity with the proteins, but rather based on Pfam 
analysis as described above. As described above, the, the bit score of 145 for the 21612 
polypeptide when fit to the Pfam short chain dehydrogenase model means that the 21612 amino 
acid sequence is 2^"^^ times more likely to belong to the short chain dehydrogenase family than to 
contain the amino acid sequence shown in Appendix B by chance. 

Furthermore, with regards to NCBI Accession No. AA622988, AppUcant notes that 
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nucleotides 27-386 of this EST sequence (which is only 386 nucleotides in length) share 99% 
sequence identity with nucleotides 1694-2052 of the nucleotide sequence set forth in SEQ ID 
N0:7. Accordingly, the level of sequence identity shared between NCBI Accession No. 
AA62298 and SEQ ED NO: 7 supports rather than undermines the conclusion that the 21612 
polypeptide has dehydrogenase activity. 

In addition, although the polypeptide of NCBI Accession No. T19954 shares 41.7% 
overall sequence with the 21612 amino acid sequence shown in SEQ ID NO: 7, it shares 
approximately 64% local sequence identity and approximately 74% local sequence similarity 
over amino acids 2 -276 of SEQ ID N0:7. This region of the 21612 polypeptide encompasses 
the Pfam short chain dehydrogenase consensus sequence shown in Appendix B. Those of skill in 
the art recognize that a high level of sequence identity within a functional domain of a 
polypeptide is a reliable indicator of polypeptide function. Therefore, the fact that the 21612 
polypeptide and the dehydrogenase of NCBI Accession No. T19954 share 74% sequence 
similarity within a region encompassing the 21612 dehydrogenase domain supports the 
conclusion that the 21612 polypeptide functions as a dehydrogenase. 

The sequence similarity shared between the 21612 polypeptide and NCBI Accession No. 
T 19954 has even greater significance because of the evolutionary divergence between Homo 
sapiens, the source of the 21612 polypeptide, and C. elegans, the source of the polypeptide of 
NCBI Accession No. T19954. Those of skill in the art recognize that lower degrees of sequence 
identity may serve as a an indicator of function when the sequences that are aligned come from 
organisms that are as evolutionarily divergent as C. elegans and Homo sapiens. For example, the 
A9 fatty acid desaturase encoded by the Saccharomyces cerevisiae OLEl gene can be 
functionally replaced by the rat stearoly-CoA desaturase gene despite the fact that the two 
proteins share only 36% overall identity. See, Stukey et al 1990, 1 Biol Chem. 265:20144- 
20149, a copy of which is provided as Appendix C, Similarly, the polypeptides encoded by the 
human PIG-B gene and the S, cerevisiae GPIIO gene both function in glycosyl- 
phosphatidylinositol anchor biosynthesis despite the fact that these polypeptides share only 33 % 
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overall identity. See, Sutterlin et al (1998) Biochem 1 332:153-159, which is provided as 
Appendix D. As another example, the human and S. cerevisiae Cdc7 polypeptides have conmion 
biochemical and functional activities despite the fact that they share only 32% sequence identity. 
See, Sato et al (1997) EMBO J. 16:4340-435 1, which provided as Appendix E. 

C. The function of the 21612 dehydrogenase was determined using methods that are 
accepted by those of skill in the art. 

As described in Applicant's Amendments mailed November 19, 2001 and June 21, 2002, 
the use of sequence similarity, particularly sequence similarity within functional domains of 
proteins, is an art-accepted, reliable tool for predicting protein function. In fact, the art is replete 
with examples of proteins whose functions have been successfully determined based on sequence 
similarity within functional domains. 

One example is seen in Nguyen et al (2001) Mol Pharmacol 59:427-433, provided 
herewith as Appendix F. This reference describes the identification of the H4 receptor based on 
a query of GenBank to identify sequences sharing sequence similarity with GPCRs (see page 
428, column 1, first full paragraph). Based on sequence similarity with the histamine receptor 
H3, the H4 polypeptide was predicted to have histamine receptor function. Subsequently, 
biochemical assays confirmed that histamine serves as a ligand for the H4 receptor and causes 
the receptor*s internalization. 

Another example is described in Dickman (1997) Science 277:1605-1606, a copy of 
which is provided as Appendix G. This reference describes the cloning and characterization of 
p73, a homolog of the tumor suppressor protein p53. The p73 polypeptide was predicted to have 
tumor suppressor activity based on its sequence similarity with p53 in several key domains, 
including the transcription activation domain, DNA binding domain, and oliogmerization domain 
(see page 160, column 3, first full paragraph). In accordance with this, p73 has subsequently 
been shown to have many of the activities of p53, including the ability to initiate cell death using 
the same pathway utilized by p53. Accordingly, sequence similarity to p53 was shown to be an 
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accurate predictor of p73 activity. 

Yet another example is described in Kliewer et al (1998) Cell 92:73-82, a copy of which 
is provided as Appendix H. This reference describes the identification of two novel nuclear 
receptors based on sequence similarity with the ligand binding domains of known nuclear 
receptors (see page 74, column 1, first full paragraph). The novel receptors, termed pregnane X 
receptor 1 and 2 (PXR.l and PXR.2) were predicted to have nuclear receptor activity, Le. 
hormone-regulated transcriptional activity, based on this sequence similarity. Biochemical 
evidence presented in the reference indicates that PXR regulates the transcription of the CYP3A 
family of steroid hydroxylases in a pregnane-dependent manner, confirming the functional 
predictions based on DNA binding domain and ligand binding domain sequence similarity. 

The examples given here, which demonstrate the accuracy of sequence similarity-based 
predictions of protein function, represent only a few of the many such instances that are found in 
the scientific literature. Accordingly, one of skill in the art would find credible the Applicant's 
statement that the 21612 polypeptide functions as a dehydrogenase based on the Pfam analysis 
described above. 

D, A knowledge of the tertiary structure of the 21612 dehydrogenase is not required in 
order to determine the function of this protein. 

The Examiner argues that experimental evidence is required to demonstrate the function 
of the 21612 polypeptide because "it is general knowledge in the art that even conservative 
amino acid substitutions can adversely affect proper folding and biological activity if amino acids 
that art critical for such functions are substituted, and the relationship between the sequence of a 
polypeptide and its tertiary structure is neither well understood nor predictable . . .the ordinary 
artisan would immediately recognize that the encoded polypeptide must assume the proper three- 
dimensional configuration to be active, which is dependent upon the surrounding residues." 
August 14, 2002 Advisory Action, page 2. Applicant notes that the function of the 21612 
dehydrogenase was not determined based on the predicted three-dimensional structure of this 
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polypeptide. Rather, the function of this protein was determined based on the high score that 
resulted when the 21612 sequence was fit with the Pfam model for ADH short chain 
dehydrogenases as described above. Accordingly, the Examiner's arguments are inapplicable to 
the methods used to determine 21612 function. 

Furthermore, while Applicant agrees that certain regions of a protein must maintain a 
particular tertiary structure in order to retain activity, it does not follow that the structure of a 
protein must be known in order to know the function of the protein. In fact, although 
biochemical functions have been determined for thousands of proteins, three-dimensional 
structures have been determined for only a very small percentage of these biochemically- 
characterized proteins. Accordingly, the knowledge of a protein's tertiary structure is not a 
prerequisite to the determination of the protein's function. Therefore, the teaching that the 
prediction of protein tertiary structure based on primary sequence is difficult does not support the 
conclusion that the methods used by the Applicant to determine the function of the 21612 
polypeptide are inaccurate. 

E. Sequence Homology is an Acceptable Basis for Determining Utility According to 
United States Patent and Trademark Office Utility Examination Guidelines 

The United States Patent and Trademark Office "Utility Examination Guidelines, 66 Fed 
Reg. 1092 (2001), make it clear that sequence homology is sufficient to estabhsh utility, and 
that, contrary to statements made by the Examiner, biochemical evidence is not a per se 
requirement for the establishing the utiUty of a novel protein. The "Utility Examination 
Guidelines" state, "[w]hen a patent application claiming a nucleic acid asserts a specific, 
substantial, and credible utility, and bases the assertion upon homology to existing nucleic acids 
or proteins having an accepted utility, the asserted utility must be accepted by the examiner 
unless the Office has sufficient evidence or sound scientific reasoning to rebut such an assertion." 
66 Fed. Reg. at 1096. 
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This statement in the "Utility Examination Guidelines" is illustrated in Example 10 of the 
"Revised Interim Utility Guidelines Training Material Examples, available at 
http://www.uspto.gov/web/offices/pac/utility/utilityguide.pdf This example is directed to a 
polypeptide that shares sequence similarity with a DNA ligase. According to the analysis in the 
Example, "DNA ligases have a well-established use in the molecular biology art based on this 
class of protein's ability to ligate DNA. Revised Interim Utility Guidelines Training Materials, 
March 1, 2000, at 54. As in Example 10, the 21612 polypeptide has been shown to share 
sequence similarity with a protein family having a known biochemical activity. Accordingly, 
based on analogy to Example 10, the present invention also meets the criteria for well-established 
utihty. 

The sequence of Example 10 of the "Training Materials" is accorded to have a specific 
and substantial utility according to the "Revised Interim Utility Guidelines Training Material 
Examples," ibid,^ despite the fact that the encoded polypeptide has not been directly 
demonstrated to have DNA ligase activity, and the substrate (i.e. single-stranded DNA or double- 
stranded DNA; blunt-ended DNA, 5* recessed ended DNA, 3' recessed ended DNA), co-factor 
requirements, and reaction conditions required to practice the invention are not disclosed. Thus, 
in accordance with the Utility Examination Guidelines, the very fact that the sequence of 
Example 10 has sequence similarity with a known protein possessing well-established utility is 
sufficient to confer a specific, substantial, and credible utility upon the claimed sequence. As the 
claimed invention of the present application is analogous to the situation described in Example 

10, the criteria for utility have been met for claims 63-67, 77-79, and 87-104. 

11. A prima facie showing of no utility has not been presented. 

The "Examination Guidelines for the Utility Requirement" (MPEP § 2107) set forth the 

elements required to establish a prima facie case of no utiHty as follows: 

Where the asserted utility is not specific or substantial, a prima facie 
showing must establish that it is more likely than not that a person of ordinary 
skill in the art would not consider that any utility asserted by the applicant would 
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be specific and substantial. The prima facie showing must contain the following 
elements: 

(i) An explanation that clearly sets forth the reasoning used in 
concluding that the asserted utility for the claimed invention is not both specific 
and substantial nor well-estabUshed; 

(ii) Support for factual findings relied upon in reaching this 
conclusion; and 

(iii) An evaluation of all relevant evidence of record, including utilities 
taught in the closest prior art. 

MPEP § 2107 (8th ed. 2001). These guidelines are in accordance with In re Brana, 34 
U.S.P.Q.2d 1437 (Fed. Cir. 1995), where the Federal Circuit held that, "[o]nly after the PTO 
provides evidence showing that one of ordinary skill in the art would reasonably doubt the 
asserted utiUty does the burden shift to the Applicant to provide rebuttal evidence sufficient to 
convince such a person of the invention's asserted utility." 34 U.S.P.Q.2d at 1441. 

In the present case, the Applicant has use methods accepted by those of skill in the art to 
demonstrate that the 21612 polypeptide fiinctions as an alcohol dehydrogenase, a class of 
polypeptides having well-established utility. Nevertheless, the Examiner has maintained the 
rejection under 35 U.S.C. § 101, stating "[t]he specification fails to show a single working 
example that establishes that the SEQ ID NO: 8 which encodes the amino acid sequence of SEQ 
ID N0:7 is a member of Alcohol dehydrogenase (ADH) family." March 25, 2002 Office Action, 
page 3, and August 14, 2002 Advisory Action, page 2. In order to support this rejection in 
accordance with the "Examination Guidelines for the Utility Requirement," the Examiner must 
provide an explanation and factual findings showing that a person of ordinary skill in the art 
would not find Applicant's assertion that the 21612 polypeptide fiinctions as an alcohol 
dehydrogenase to be credible. However, contrary to the requirements of MPEP § 2107, the 
Examiner has not provided factual findings or evidence that support this conclusion. 
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Furthermore, the AppHcant has provided evidence demonstrating that those of skill in the 
art accept the methods used to determine the function of the 21612 polypeptide. Under the 
Guidelines, "[t]he examiner's decision [with respect to patentable utility] must be supported by a 
preponderance of all the evidence of record," MPEP § 2107.02 (8th ed. 2001), citing In re 
Oetiker, 24 U.S.P.Q.2d 1443 (Fed. Cir. 1992). In the present case, the Applicant's references 
support the finding that the 21612 polypeptide functions as an alcohol dehydrogenase, and the 
evidence presented by the Examiner would not lead one of skill in the art to question the 
credibility of this assertion. Accordingly, the preponderance of the evidence supports the 
conclusion that the 2162 polypeptide has patentable utility. 

III. The invention of claims 63-67, 77-79, and 87-104 has utility under 35 U.S.C. §101. 

The Applicant has demonstrated that the 21612 protein is a member of the alcohol 
dehydrogenase family, a protein family having well-established utility. The Applicant has further 
shown that the methods used to determine the function of the 21612 dehydrogenase are rehable 
and well-accepted by those of skill in the art. The Examiner has argued that Applicant's 
functional assignment for the 21612 polypeptide is not credible, but has not provided sufficient 
evidence or reasoning to establish a prima facie case of no utility. For these reasons, the rejection 
of claims 63-67, 77-79, and 87-104 under 35 U.S.C. §101 for lack of patentable utility should be 
reversed. 

Issue 2~Wh ether the invention of claims 63-67, 77-79, and 87-104 is enabled under 35 
U.S.C. § 112, first paragraph. 

I. The Applicant has demonstrated that the 21612 polypeptide functions as an alcohol 
dehydrogenase. 

The Examiner has maintained the rejection of claims 63-67, 77-79, and 87-104 under 35 
U.S.C. § 1 12, first paragraph, on the grounds that the specification does not provide sufficient 
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guidance to enable one skilled in the art to make and use the claimed invention. The Examiner 
argues that the claimed invention is not enabled because "[i]t is unclear how one skill in the art 
would use the invention as claimed when the function of the polypeptide encoded by the 
nucleotide sequence of SEQ ID N0:8 is not known." March 25, 2002 Office Action, page 5. 
However, Applicant has shown that the 21612 polypeptide has dehydrogenase activity as 
described above in the arguments addressing the rejection under 35 U.S.C. § 101. Furthermore, 
the Examiner has presented no evidence to demonstrate that one of skill the art would doubt the 
credibility of Applicant's assertion that the 21612 polypeptide functions as a dehydrogenase. 
Accordingly, the premise on which this grounds for the rejection is based, i.e. that the 21612 
polypeptide does not have dehydrogenase activity, is not supported by the evidence of record. 

II. The speciflcation provides sufficient guidance to allow one of skill in the art to make 
and use functional variants of the 21612 dehydrogenase. 

The Examiner argues that the specification does not provide sufficient guidance to allow 
one of skill in the art to make and use variants of the 21612 polypeptide having dehydrogenase 
activity. In fact, sufficient guidance for making and using the claimed variant sequences is given 
in the specification. Applicant has provided the entire 21612 amino acid sequence, SEQ ID 
N0:7, as well as a nucleotide sequence which encodes it, SEQ ID N0:8 The variant nucleotide 
sequences of claims 88-100 vary fi-om the nucleotide sequence of SEQ ID NO 8 by structural 
parameters (i.e. percent sequence identity to SEQ E) NO: 8, hybridization with the complement 
of SEQ ID NO:8 under stringent conditions, or deletion) that are defined in the specification, and 
the claimed variants retain the dehydrogenase activity of the 21612 polypeptide having the amino 
acid sequence set forth in SEQ ID N0:7. Guidance for determining percent sequence identity 
and hybridization under stringent conditions is provided in the specification. See, page 18, line 4 
et seq. and page 60, line 8 et seq. Polypeptide sequence variants that retain fimction are also 
described in the specification as containing "only conservative variation or variation in non- 
critical residues or in non-critical regions." See, page 21, lines 20-21 of the specification. 
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Guidance regarding conservative substitutions of amino acids is found in the specification on 
page 17, lines 6-12 and in Table 1. 

Furthermore, the 21612 polypeptide shares a high level of sequence identity with a 
consensus domain that is conserved among members of the short chain dehydrogenase family. 
See, Appendix B. The specification also teaches methods for determining additional residues 
that are essential for fiinction, including site-directed mutagenesis and alanine-scanning 
mutagenesis. See, page 22, lines 1-10. 

Finally, the specification provides guidance regarding assays for dehydrogenase activity 
on page 22, lines 11-19. Accordingly, one of skill in the art would be able to determine the 
fiinctionality of 21612 variants. 

Thus, a rational scheme for determining the regions of the 21612 short chain 
dehydrogenase that would tolerate modification is provided. Based on the regions of the 21612 
polypeptide that are conserved with other short chain dehydrogenases, and the methods provided 
for identifying additional residues critical for 21612 function, the skilled artisan could choose 
among possible modifications to produce polypeptides encodes by nucleotide sequences within 
structural parameters set forth in the claims and then test these modified variants to determine if 
they retain dehydrogenase activity. Although some quantity of experimentation would be 
required, the level of experimentation would not be undue in view of the amount of direction 
provided in the specification, the state of the prior art, and the level of skill of one of ordinary 
skill in the art. These factors all favor a conclusion that one of skill in the art could practice the 
claimed invention without undue experimentation. 

Applicant notes that an enabling disclosure need describe the claimed invention in such a 
way as to enable the ordinarily skilled artisan to make and use the invention, and that this 
description be commensurate with the scope of the claimed invention. There is no requirement 
that the disclosure provide working examples of every permutation of the invention. 

Nevertheless, the Examiner has maintained the rejection on the grounds that "the 
specification fails to disclose that any and all variants of SEQ ID N0:7 (as claimed) are capable 
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of eliciting any ADH-like activity." March 25, 2002 Office Action, page 5. However, Applicant 
has not asserted that every nucleotide sequence meeting the structural limitations of claims 88-92 
will also encode a polypeptide having dehydrogenase activity. Rather, these claims are limited to 
nucleotide sequences meeting both the structural requirements of these claims (i.e. the nucleotide 
sequences share sequence identity with the nucleotide sequence set forth in SEQ ID NO: 8, 
hybridize with the nucleotide sequence set forth in SEQ ID NO: 8 imder specified conditions, or 
contain a subset of the nucleotide sequence set forth in SEQ ID N0:8 or the nucleotide sequence 
of the cDNA insert of the plasmid deposited with ATCC as patent deposit number PTA-2170) 
and the functional requirements of these claims (i.e. the nucleotide sequences encode a 
polypeptide having dehydrogenase activity). Furthermore, as described in Applicant's response 
mailed November 19, 2001, the specification provides sufficient guidance to allow one of skill in 
the art to make nucleotide sequences falling within the structural limitations of the claims and 
determine whether these sequences encode polypeptides having the functional limitation of the 
claims. Accordingly, the scope of enablement provided in the specification is commensurate 
with the scope of the claims. 

Applicant cites Gayle et al (1993) J. Biol Chem. 268:22105-221 1 1, provided herewith as 
Appendix I to demonstrate that making functional variants of polypeptides is routine to those of 
skill in the art and does not require undue experimentation. This reference describes saturation 
mutagenesis of the mature human interluekin-la (IL-la) sequence. The authors report that more 
than 3,500 mutants of IL-la were examined to determine their biological activity and their 
binding activity, demonstrating that such mutants can be made and tested for function without 
undue experimentation. Furthermore, the authors state that "[m]ost of the molecule could be 
mutated with little effect on either [biological or binding] activity." Gayle et al at 22109, 
column 2. 

The Examiner has argued that the specification does not enable variants of the 21612 
polypeptide on the grounds that "[i]t is general knowledge in the art that even conservative amino 
acid substitutions can adversely affect proper folding and biological activity if amino acids that 
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are critical for such functions are substituted, and the relationship between the sequence of the 
polypeptide and its tertiary structure is neither well understood nor predictable." March 25, 2002 
Office Action, page 6. The premise of the Examiner's argument appears to be that Applicant 
must demonstrate a working example of each and every sequence falling within the limitation of 
the claims, and that the claimed invention is enabled only if no experimentation is required to 
make and use the claimed variants. This requirement is not supported by the applicable case law. 
The test of enablement is not whether experimentation is necessary, but rather if experimentation 
is necessary, whether it is undue. In reAngstadt, 198 USPQ 214, 219 (C.C.P.A. 1976). Factors 
to be considered in determining whether undue experimentation is required include the quantity 
of experimentation necessary, the amount of guidance provided in the specification, the presence 
of working examples of the invention in the application, the nature of the invention, the state of 
the prior art, the relative skill of those in the art, the predictability in the art, and the breadth of 
the claimed invention. In re Wands, 8 USPQ 2d 1400, 1404 (Fed. Cir. 1988). 

Based on the guidance provide in the specification, the regions of the 21612 polypeptide 
that are conserved with other short chain dehydrogenases, and the methods provided for 
identifying additional residues critical for 21612 function, the skilled artisan could choose among 
possible modifications to produce polypeptides encoded by nucleotide sequences within 
structural parameters set forth in the claims and then test these modified variants to determine if 
they retain dehydrogenase activity. Although some quantity of experimentation would be 
required, the level of experimentation would not be undue in view of the quantity of 
experimentation necessary, the amount of direction provided in the specification, the state of the 
prior art, the presence of a working example of the invention, the level of skill of one of ordinary 
skill in the art, and the breadth of the claimed invention. These factors all favor a conclusion that 
one of skill in the art could practice the claimed invention without imdue experimentation. 
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III. The case law cited by the Examiner does not support the rejection for lack of 
enablement under 35 U.S.C. §, 112, first paragraph. 

In the Advisory Action mailed August 14, 2002, the Examiner cites In re Fisher, 166 
USPQ 19 (CCPA 1970), Hybritech Inc. V, Monoclonal Antibodies, Inc. 231 USPQ 81 (Fed. Cir. 
1986), Genentech Inc. v. Novo Nordisk 42 USPQ2d 1005 (CAFC 1997), In re Wands 8 USPQ2d 
1400 (Fed. Cir. 1988) and Ex parte Singh 17 USPQ2d (BPAI 1991) in support of the argument 
that Applicant must teach each and every sequence that may that may be added, deleted, or 
substituted to the sequences specifically disclosed. However, none of the cited cases support the 
enablement standard applied by the Examiner. 

The facts of Fisher are distinguishable from the present case, because in Fisher the 
applicants had claimed adrenocorticotrophic hormones (ACTHs) of "at least 24 amino acids." 
The Court found that, at the time (the 1960s), the art suggested that all known ACTHs contained 
39 amino acids and that one of skill in the art would not know how to make other ACTHs. In the 
present case, by contrast, guidance regarding the conserved regions of dehydrogenase is both 
known in the art and provided in the specification. Moreover, methods for making altered 
sequences are routine in the art, as are techniques for testing for alcohol dehydrogenase activity. 
Thus, Fisher is inapposite to the present application. 

In Hybritech, the issue before the court was whether the specification of U.S. Patent No. 
4,376,1 10 failed to provide sufficient enablement for the claimed methods of determining 
antibody concentration because the specification did not disclose how to make monoclonal 
antibodies, screen for proper monoclonal antibodies, and measure monoclonal antibody affinity. 
The court held that enablement "is not precluded even if some experimentation is necessary, 
although the amount of experimentation must not be unduly extensive . . .[f]urthermore, a patent 
need not teach, and preferably omits, what is well known in the art." 231 USPQ at 94. The court 
found that, based on the evidence presented at trial, "there was not a shred of evidence that undue 
experimentation was required by those skilled in the art to practice the invention." Id. Similarly, 
in the present application, methods of making and testing variant polypeptides are well known in 
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the art and thus undue experimentation is not required to make and use the claimed 21612 
variants. 

In Wands, the court held that the test of whether an invention requires undue 
experimentation is not based on a single factor, but rather a conclusion reached by weighing 
many factors. The factors to be considered in determining whether undue experimentation is 
required include the quantity of experimentation necessary, the amount of guidance provided in 
the specification, the presence of working examples of the invention in the application, the nature 
of the invention, the state of the prior art, the relative skill of those in the art, the predictability in 
the art, and the breadth of the claimed invention. 8 USPQ2d at 1404. 

Accordingly, the holding of Wands does not require that an applicant identify each and 
every operable embodiment as asserted by the Examiner. Rather, Wands sets out factors to be 
considered in determining whether undue experimentation is required to practice the claimed 
invention. The Applicant has provided the novel polypeptide of SEQ ED NO: 7 and has 
demonstrated that this protein is a member of the ADH short chain dehydrogenase family. Thus, 
contrary to statements made in the office action, a working example of the claimed invention is 
provided. Furthermore, based on the guidance provide in the specification, the regions of the 
21612 polypeptide that are conserved with other ADH family dehydrogenases, and the methods 
provided for identifying additional residues critical for 21612 function, the skilled artisan could 
choose among possible modifications to produce polypeptides encoded by nucleotide sequences 
within structural parameters set forth in the claims and then test these modified variants to 
determine if they retain dehydrogenase activity. Although some quantity of experimentation 
would be required, the level of experimentation would not be undue in view of the quantity of 
experimentation necessary, the amount of direction provided in the specification, the state of the 
prior art (as demonstrated by Gayle et al), the presence of a working example of the invention, 
the level of skill of one of ordinary skill in the art, for whom the making and testing of alcohol 
dehydrogenase variants is routine. These factors all favor a conclusion that one of skill in the art 
could practice the claimed invention without undue experimentation. 
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The Examiner cites Genentech for the court's statement that "[i]t is the specification, not 
the knowledge of one skilled in the art, that must supply the novel aspects of an invention in 
order to constitute enablement." 42 USPQ2d at 1005. However, in the present case the 
specification does provide the novel aspect of the invention, i.e., the 21612 dehydrogenase 
sequences given in SEQ ID N0:7 and SEQ ID N0:8. Once these novel sequences are disclosed, 
it is routine to those of skill in the art to make functional variants as illustrated by the Gayle et al. 
reference cited above. In Genentech, the court held that "where there is no disclosure of any 
specific starting material or of any of the conditions under which a process can be carried out, 
undue experimentation is required." 42 USPQ2d at 1005. In the present case, the specification 
provides both the starting material, i,e, the 21612 sequence, and a rational scheme for making 
functional variants and fi-agments. Accordingly, the holding of Genentech is inapposite to the 
facts of the present case. 

In Singh, the issue before the Board of Patent Appeals and Interferences was whether the 
specification of U.S. Patent AppHcation No. 06/506,098 provided sufficient guidance to enable 
claims to a process of expressing heterologous polypeptides in yeast. The Board found that the 
claims were not sufficient enabled because the complete secretory pathway of yeast was not 
known at the time the invention was made and thus is was not predictable that any protein other 
than the one specifically disclosed in the specification could be successfully expressed in yeast. 
The Examiner cites this case for the court's statement that "[i]t has been recognized that the 
unpredictability of a particular art area may alone provide a reasonable doubt as to the accuracy 
of a broad statement made in support of the enablement of a claim." 17 USPQ2d at 1715. 
However, the Examiner has presented no evidence to demonstrate that making and using 
functional variants of the 21612 polypeptide is sufficiently unpredictable to constitute undue 
experimentation. Furthermore, the Gayle et al, reference provided by Applicant as Appendix I 
demonstrates that making and using functional variants of a known sequence is routine to those 
of skill in the art. Accordingly, the holding of Singh is inapposite to the facts of the present case. 
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IV. The specification provides enablement for the host cells of claims 65-67 and 95- 
97 that is commensurate with the scope of these claims. 

The Examiner has rejected claims 65-67 and 95-97 on the grounds that these claims 
encompass host cells in vivo and therefore Applicant must provide guidance for the use of the 
sequences in gene therapy and in producing transgenic animals. The Examiner argues that "gene 
therapy is considered a highly experimental area of research at this time, and both researchers and 
the public agree that demonstratable progress to date has fallen short of initial expectations." 
March 25, 2002 Office Action, page 7. The Examiner also argues that "the phenotype of an 
animal is determined by a complex interaction of genetics and environment." March 25, 2002, 
Office action, page 7. However, the arguments presented in the Office Action are not relevant to 
the claimed subject matter because the claims are not directed to methods of gene therapy or to 
methods of producing a transgenic animal having a particular phenotype but instead are directed 
to host cells containing specified nucleic acid molecules. Methods of producing such host cells 
are described on pages 85-90 of the specification. Based on the guidance provided, one of skill 
in the art could produce the claimed host cells without undue experimentation. The proper test 
for the enablement of an invention is whether the specification provides enablement 
commensurate with the scope of what is claimed. The specification has provided sufficient 
guidance to allow one of skill in the art to make and use the host cells of claims 65-67 and 95-97, 
and therefore the enablement requirement is met. 

V. The invention of claims 63-67, 77-79, and 87-104 is enabled under 35 U.S.C. § 112, 
first paragraph. 

In Genentech v. Novo Nordisk, 42 USPQ 1001 (Fed. Cir. 1997) the Federal Circuit held 
that "[t]o be enabling, the specification of a patent must teach those skilled in the art how to 
make and use the fiiU scope of the claimed invention without *undue experimentation/ " 42 
USPQ at 1004, citing In re Wright, 999 F.2d 1557, 1561, 27 USPQ2d 1510, 1513 
(Fed.Cir.1993); Amgen, Inc. v. Chugai Pharm. Co., 927 F.2d 1200, 1212, 18 USPQ2d 1016, 
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1026 (Fed.Cir.1991); In re Fisher, 57 C.C.P.A. 1099, 427 F.2d 833, 839, 166 USPQ 18, 24 
(1970), emphasis added. Applicant has provided the sequence of a novel dehydrogenase, 
provided a rational scheme for making functional variants and fragments of this dehydrogenase, 
and demonstrated that making such variants and fragments is routine to those of skill in the art. 
Accordingly, the requirements for enablement have been met, and the rejection of claims 63-67, 
77-79, and 87-104 under 35 U.S.C. §112, first paragraph, for lack of enablement should be 
reversed. 

Issue 3— Whether a sufficient written description of the invention of claims 88-92 is provided 
under 35 U.S.C. § 11 2, first paragraph. 

I. The Applicant has demonstrated that the 21612 polypeptide functions as an alcohol 
dehydrogenase. 

The Examiner has maintained the rejection of claims 88-92 under 35 U.S.C. § 1 12, first 
paragraph, on the grounds that the specification does not provide an adequate written description 
for the claimed invention because "the applicant fails to point out where in the specification it is 
disclosed that the polypeptide encoded by the nucleic acid molecule of SEQ ID NO: 8 have any 
alcohol-dehydrogenase-like activity explicitly or implicitly as putatively consider by the 
applicant." March 25, 2002 Office Action, page 8. However, Applicant has shovra that the 
21612 polypeptide has dehydrogenase activity as described in the specification and addressed in 
the arguments above relating to the rejection under 35 U.S.C. § 101. Furthermore, the Examiner 
has presented no evidence to demonstrate that one of skill the art would doubt the credibility of 
Applicant's assertion that the 21612 polypeptide functions as a dehydrogenase. Accordingly, the 
premise on which the rejection is based, i.e. that the 21612 polypeptide does not have 
dehydrogenase activity, is not supported by the evidence of record. 
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IL An improper standard has been applied in determining whether the specification 
provides a sufficient written description of the invention of claims 88-92. 

The Examiner states that the rejection under 35 U.S.C. § 1 12, first paragraph for lack of 
written description in maintained because "[t]he specification fails to disclose any and all 
variants of nucleic and amino acid sequences of SEQ ID NO(s) as claimed." March 25, 2002 
Office Action, page 8. This statement suggests that the Applicant must disclose the sequence of 
each variant falling within the structural and functional limitations set forth in the claims in order 
to adequately describe the claimed genus of sequences. However, the requirement set forth in the 
office action is not supported by the "Guidelines for Examination of Patent Applications Under 
the 35 U.S.C. §1 12, ^ 1, 'Written Description' Requirement'* (66 Fed. Reg. 1099 (2001)) or the 
supporting case law. 

The "Guidelines for Examination of Patent Applications Under the 35 U.S.C. 1 12, ^ 1, 
'Written Description' Requirement" state that genus may be described by "sufficient description 
of a representative number of species ... or by disclosure of relevant, identifying characteristics, 
Le, structure or other physical and/or chemical properties." Id, at 1 106. This requirement is in 
accordance with Regents of the University of California v. Eli Lilly c& Co., 1 19 F.3d 1559 (Fed. 
Cir. 1997), where the court held that "[a] written description of an invention involving a 
chemical genus, like a description of a chemical species, 'requires a precise definition, such as by 
structure, formula, or chemical name' of the claimed subject matter sufficient to distinguish it 
from other materials." 119 F.3d at 1568, citing Fiers v l^evel 984 F.2d 1 164 (Fed. Cir. 1993). 

Applicant submits that the written description provided for the sequences recited in 
claims 88-92 meets this requirement. These claims recite the identifying structural 
characteristics that define each genus of nucleotide sequences. Claims 88-90 recite nucleotide 
sequences having at least 70%, 80%, or 90% sequence identity with the nucleotide sequence set 
forth in SEQ ID N0:8, claim 91 recites nucleotide sequences that hybridize to the nucleotide 
sequence set forth in SEQ ID NO: 8 under specified conditions, and claim 92 recites nucleotide 
sequences encoding a fragment of the amino acid sequence set forth in SEQ ID NO: 7 or the 
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amino acids sequence encoded by the cDNA insert of the plasmid deposited with ATCC as 
Patent Deposit Number PTA-2170, wherein the fragment has dehydrogenase activity and consists 
of at least 139 contiguous amino acids of the amino acid sequence set forth in SEQ ID N0:7 or 
the amino acid sequence encoded by the cDNA insert of the plasmid deposited with ATCC as 
Patent Deposit Number PTA-2170. These structural limitations are sufficient to distinguish the 
claimed nucleotide sequences from other materials and thus sufficiently define the claimed 
genus. 

Furthermore, in Regents of the University of California v. Eli Lilly & Co, 119 F.3d 1559 
(Fed. Cir. 1997), the court held that "[a] description of a genus of cDNAs may be achieved by 
means of a recitation of a representative number of cDNAs, defined by nucleotide sequence, 
falling within the scope of the genus or of a recitation of structural features common to the 
members of the genus, which features constitute a substantial portion of the genus." 119 F.3d at 
1569. The recitation of the structural features of sequence identity with SEQ ID N0:7, 
hybridization with SEQ ID N0:7, or the presence of subsequences of SEQ ID N0:8 or the amino 
acid sequence encoded by the plasmid deposited with ATCC as Patent Deposit Number PTA- 
2170, where the fragments have a given minimum length, is sufficient to satisfy this requirement. 

Applicant has further provided the functional characteristics that distinguish the claimed 
sequences of the genus. Specifically, claims 88-92 recite that the variants and fragments have 
dehydrogenase activity. Accordingly, both the structural properties and the functional properties 
that characterize the claimed genus are specifically recited in the claims. 

The present claims are analogous to those presented in Example 14 of the "Revised 
Interim Written Description Guidelines Training Materials," available at 
www.uspto.gov/web/menu/written.pdf Example 14 is directed to a protein having at least 95% 
sequence identity to the sequence of SEQ E) NO: 3, wherein the sequence catalyzes the reaction 
A B. The conclusion in the Training Materials is that the generic claim of Example 14 is 
sufficiently described under § 1 12, first paragraph, because 1) "the single sequence disclosed in 
SEQ ID NO: 3 is representative of the genus" and 2) the claim recites a limitation requiring the 
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compound to catalyze the reaction from A — > B. Therefore one of skill in art would recognize 
that the Applicant was in possession of the necessary common attributes possessed by the 
members of the genus. 

Following the analysis of Example 14, AppUcant submits that claims 88-92 satisfy the 
written description requirements of § 112, first paragraph. Specifically, the claims of the present 
invention encompass nucleotide sequences having sequence identity to the nucleotide sequence 
of SEQ ID NO: 8, hybridizing under stringent conditions to the nucleotide sequence of SEQ ID 
N0:8, comprising a subsequence of SEQ ID N0:7, wherein the claimed sequences encode a 
polypeptide having a specified activity. As in Example 14, the specification discloses the nucleic 
acid sequence of SEQ ID N0:1, and the claims recite a limitation requiring the compound to 
have a specific fiinction (i.e. dehydrogenase activity). Accordingly, claims 88-92 provide the 
relevant, identifying characteristics that describe the claimed genus, and one of skill in the art 
would recognize that the inventors were in possession of the claimed invention. 

III. The case law cited by the Examiner does not support the rejection for insufficient 
written description under 35 U.S.C. §, 112, first paragraph. 

The Examiner cites In re Shokal 113 USPQ 283 (CCPA 1957), Purdue Pharma LP. v. 
Faulding Inc.56 USPQ2d 1481 (Fed. Cir. 2000), and Fiddes v. Baird, 30 USPQ2d 1481 (BPAI 
1993) in support of the rejection for insufficient written description. However, none of the cited 
cases support the standard for written description applied by the Examiner. 

The Examiner cites Shokal in support of the argument that a single species can rarely be 
used to support a generic claim. However, the facts of the present case are readily 
distinguishable fi-om those in Shokal. In Shokal, the specification filed by the applicants did not 
define the claimed genus by identifying characteristics that would distinguish the claimed 
invention from other compounds. Instead, the applicants attempted to establish possession of the 
claimed genus by the disclosure of representative species. 
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In contrast, the claimed genus in the present application is defined in the specification by 
the functional properties (ue, dehydrogenase activity) and the structural properties (Le. sequence 
identity with disclosed sequences, hybridization with disclosed sequences, or deletions of 
disclosed sequences) shared by the encompassed species. This description is sufficient to 
distinguish the claimed genus firom other materials. Accordingly, the disclosure of many species 
within the claimed genus is not required. Indeed, the court in Shokal makes this distinction, 
stating, ''where the genus is not set forth in express terms, the number and nature of the examples 
given, together with the accompanying disclosure must be such as to indicate clearly what the 
genus actually is." 113 USPQ at 285, emphasis added. 

The Examiner also cites Purdue Pharma in support of the argument that a single species 
is rarely sufficient to describe a broad genus. The issue before the court in Purdue Pharma was 
whether the specification of U.S. Patent No. 5,672,360 provided a sufficient written description 
for a claim to a method for treating pain using an opioid at concentration such that the maximum 
plasma concentration (Cmax) was more than twice the plasma level of the opioid 24 hours after 
administration (C24). The court found that "there is nothing in the written description . . . that 
would suggest to one skilled in the art that the Cmax/C24 ratio is an important defining quality of 
the formulation." 56 USPQ2d at 1486. The court fiirther stated that "[w]hat the '350 patentees 
have done is to pick a characteristic possessed by two of their formulations, a characteristic that 
is not discussed even in passing in the disclosure, and then make it the basis of claims that cover 
not just those two formulations, but any formulation that has that characteristic." USPQ2d at 
1487. In contrast, the structural and fimctional limitations recited in claims 88-92 are specifically 
described in the specification. Accordingly, the present case is factually distinguishable fi"om 
Purdue Pharma, and this case does not support the rejection of claims 88-92 for insufficient 
written description. 

In Fiddes, the issue before the Board of Patent Appeals and Interferences was whether a 
count directed to a recombinant DNA molecule encoding any mammalian basic fibroblast growth 
factor (FGF) was sufficiently described by the disclosure of the amino acid sequence of bovine 
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FGF. The Board found that the claimed subject matter was not sufficiently described because the 
specification had described no naturally-occurring genes encoding FGF and thus had not 
described even one species falUng within the claimed genus of DNA molecules. The Board 
stated that the inventor "was not in possession of the naturally occurring gene encoding bovine 
pituitary FGF . . . thus Baird was not in possession of the genes encoding mammalian FGFs since 
in 1987, knowledge of the amino acid sequence of a protein coupled with the estabhshed 
relationship between a nucleic acid and the protein it encodes would not establish possession of 
the gene encoding that protein." 30 USPQ2d at 1483. In contrast, the present application does 
provide the naturally-occurring human nucleotide sequence encoding the 21612 dehydrogenase. 
In addition, claims 88-92, unlike the count that was at issue in Fiddes, specifically recite the 
structural features that characterize the claimed genus of nucleotide sequences. As described 
above, these structural features are sufficient to distinguish the claimed nucleotide sequences 
from other materials and therefore satisfy the requirements of 35 U.S.C. § 1 12, first paragraph. 

IV. The invention of claims 88-92 is sufficiently described under 35 U.S.C. § 112, first 
paragraph. 

In Regents of the University of California v. Eli Lilly &Co.,\ 19 F.3d 1559 (Fed. Cir. 
1997), the court held that "[a] written description of an invention involving a chemical genus, 
like a description of a chemical species, 'requires a precise definition, such as by structure, 
formula, or chemical name' of the claimed subject matter sufficient to distinguish it from other 
materials." 119 F.3d at 1568, citing Fiers v T^pvrI 984 F.2d 1 164 (Fed. Cir. 1993). Claims 88- 
92 recite both the structural features and the functional properties that characterize the claimed 
genus of nucleotide sequences and distinguish them from other materials. Accordingly, claims 
88-92 provide the relevant, identifying characteristics that describe the claimed genus, and one of 
skill in the art would recognize that the inventors were in possession of the claimed invention. 
For these reasons, the rejection of claims 88-92 under 35 U.S.C. §1 12, first paragraph, for 
insufficient written description should be reversed. 
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Issue 4— Whether claim 79 is indefinite under 35 U.S.C. § 112, second paragraph. 

The Examiner has maintained the rejection of claim 79 under 35 U.S.C. § 1 12, second 
paragraph on the grounds that the claim is indefinite because "it is unclear what the 'instructions 
for use' would be in this context." August 14, 2002 Office Action, page 5. Claim 79 recites, "[a] 
kit for use in the method of claim 77, wherein said kit comprises at least one nucleic acid probe 
of claim 77 and instructions for use in the method of claim 77." Claim 77 recites, "[a] method 
for detecting the presence of a nucleic acid molecule of claim 87 in a sample, said method 
comprising the steps of contacting the sample with a nucleic acid probe which selectively 
hybridizes to the nucleic acid molecule and determining whether the nucleic acid probe binds to 
the nucleic acid molecule in the sample . . Thus, claim 77 specifically recites the essential steps 
of the claim method of detection, and one of skill in the art would recognize what is intended by 
the phrase "instructions for use in the method of claim 77." For these reasons, the rejection of 
claim 79 under 35 U.S.C. §1 12, second paragraph, should be reversed. 
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CONCLUSION 

In view of the arguments presented above, Applicant contends that each of claims 63-67, 
77-79, and 87-104 is patentable. Therefore, reversal of the rejections under 35 U.S.C. §101 and 
35 U.S.C. §112, first and second paragraphs, is respectfully solicited. 
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APPEALED CLAIMS 



63. The nucleic acid molecule of claim 87 further comprising vector nucleic 
acid sequences. 

64. The nucleic acid molecule of claim 87 further comprising nucleic acid 
sequences encoding a heterologous polypeptide. 

65. A host cell which contains the nucleic acid molecule of claim 87. 



66. The host cell of claim 65 which is a mammalian host cell. 



67. A nonhuman mammalian host cell containing the nucleic acid molecule of 
claim 87. 



77. A method for detecting the presence of a nucleic acid molecule of claim 
87 in a sample, said method comprising the steps of contacting the sample with a nucleic 
acid probe which selectively hybridizes to the nucleic acid molecule and determining 
whether the nucleic acid probe binds to the nucleic acid molecule in the sample; wherein 
said nucleic acid probe is selected from the group consisting of: 

a) the nucleotide sequence set forth in SEQ ID N0:8; 
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b) the nucleotide sequence of a fragment of the nucleotide sequence 
set for in SEQ ID N0:8, wherein said fragment comprises at least 417 contiguous 
nucleotides of the nucleotide sequence set forth in SEQ ID N0:8; 

c) a nucleotide sequence having at least 70% sequence identity to the 
nucleotide sequence set forth in SEQ ED NO: 8; and 

d) a nucleotide sequence complementary to a nucleotide sequence of 
a), b), or c). 

78. The method of claim 77, wherein the sample comprises mRNA molecules. 

79. A kit for use in the method of claim 77, wherein said kit comprises at least 
one nucleic acid probe of claim 77 and instructions for use in the method of claim 77. 

87. An isolated nucleic acid molecule comprising a nucleotide sequence 
selected from the group consisting of: 

a) the nucleotide sequence set forth in SEQ ID NO: 8; 

b) the nucleotide sequence of the cDNA insert of the plasmid 
deposited with ATCC as Patent Deposit Number PTA-2170; 

c) a nucleotide sequence encoding the amino acid sequence set forth 
in SEQIDN0:7; 

d) a nucleotide sequence encoding the amino acid sequence encoded 
by the cDNA insert of the plasmid deposited with ATCC as Patent Deposit Number PTA- 
2170; and 

e) a nucleotide sequence complementary to a nucleotide sequence of 

a), b), c), or d). 

88. An isolated nucleic acid molecule having a nucleotide selected from the 
group consisting of: 

a) a nucleotide sequence encoding a polypeptide having 
dehydrogenase activity, wherein said nucleotide sequence has at least 70% sequence 
identity with the nucleotide sequence set forth in SEQ ID NO: 8; and 
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b) a nucleotide sequence complementary to the nucleotide sequence 

of a). 

89. The nucleic acid molecule of claim 88, wherein said nucleotide sequence 
is selected from the group consisting of: 

a) a nucleotide sequence encoding a polypeptide having 
dehydrogenase activity, wherein said nucleotide sequence has at least 80% sequence 
identity with the nucleotide sequence set forth in SEQ ID N0:8; and 

b) a nucleotide sequence complementary to the nucleotide sequence 

of a). 

90. The nucleic acid molecule of claim 89, wherein said nucleotide sequence 
is selected from the group consisting of: 

a) a nucleotide sequence encoding a polypeptide having 
dehydrogenase activity, wherein said nucleotide sequence has at least 90% sequence 
identity with the nucleotide sequence set forth in SEQ ID N0:8; and 

b) a nucleotide sequence complementary to the nucleotide sequence 

of a). 

91. An isolated nucleic acid molecule comprising a nucleotide sequence 
selected from the group consisting of: 

a) a nucleotide sequence encoding a polypeptide having 
dehydrogenase activity, wherein the complement of said nucleotide sequence hybridizes 
under stringent conditions to the nucleotide sequence set forth in SEQ ID N0:8, said 
stringent conditions comprising hybridization at about 45°C, followed by at least one 
wash in 0.2X SSC/0.1% SDS at 65°C; and 

b) a nucleotide sequence complementary to the nucleotide sequence 

of a). 

92. An isolated nucleic acid molecule comprising a nucleotide sequence 
selected from the group consisting of 
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a) a nucleotide sequence encoding a fragment of the amino acid 
sequence set forth in SEQ ID N0:7, wherein said fragment has dehydrogenase activity 
and consists of at least 139 contiguous amino acids of SEQ ID N0:7; and 

b) a nucleotide sequence encoding a fragment of the amino acid 
sequence encoded by the cDNA insert of the plasmid deposited with ATCC as Patent 
Deposit Number PTA-2170, wherein the fragment has dehydrogenase activity and 
consists of at least 139 contiguous amino acids of the amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with ATCC as Patent Deposit Number PTA- 
2170; and 

c) a nucleotide sequence complementary to the nucleotide sequence 

of a) or b). 

93. The nucleic acid molecule of claim 88 fixrther comprising vector nucleic 
acid sequences. 

94. The nucleic acid molecule of claim 88 fiirther comprising nucleic acid 
sequences encoding a heterologous polypeptide. 

95. A host cell which contains the nucleic acid molecule of claim 88. 

96 The host cell of claim 95, wherein said host cell is a mammaUan host cell. 

97. A nonhuman mammaUan host cell containing the nucleic acid molecule of 
claim 88. 

98. A method for detecting the presence of a nucleic acid molecule of claim 
88 in a sample, said method comprising the steps of contacting the sample with a nucleic 
acid probe which selectively hybridizes to the nucleic acid molecule and determining 
whether the nucleic acid probe or primer binds to a nucleic acid molecule in the sample; 
wherein said nucleic acid probe is selected from the group consisting of: 

a) the nucleotide sequence set forth in SEQ ID N0:8; 
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b) the nucleotide sequence of a fragment of the nucleotide sequence 
set for in SEQ ID NO: 8, wherein said fragment comprises at least 417 contiguous 
nucleotides of the nucleotide sequence set forth in SEQ JD N0:8; 

c) a nucleotide sequence having at least 70% sequence identity to the 
nucleotide sequence set forth in SEQ ID N0:8; and 

d) a nucleotide sequence complementary to a nucleotide sequence of 
a), b), or c). 

99. The method of claim 98, wherein the sample comprises mRNA molecules. 

100. A kit for use in the method of claim 98, wherein said kit comprises at least 
one nucleic acid probe of claim 98 and instructions for use in the method of claim 98. 

101 . A method for producing a polypeptide, said method comprising culturing 
a host cell containing a nucleic acid molecule of claim 87 under conditions in which the 
polypeptide encoded by the nucleic acid molecule is expressed. 

1 02. A method for producing a polypeptide, said method comprising culturing 
a host cell containing a nucleic acid molecule of claim 88 imder conditions in which the 
polypeptide encoded by the nucleic acid molecule is expressed. 

103. A method for producing a polypeptide, said method comprising culturing 
a host cell containing a nucleic acid molecule of claim 90 under conditions in which the 
polypeptide encoded by the nucleic acid molecule is expressed. 

104. A method for producing a polypeptide, said method comprising culturing 
a host cell containing a nucleic acid molecule of claim 91 under conditions in which the 
polypeptide encoded by the nucleic acid molecule is expressed. 
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In re: Meyers 

Appl. No. 09/464,039 

Filed December 15, 1999 



Protein Family / Domain Matches, HMMer version 2 

Searching for complete domains ^^..^k^^^ 
hmmpfam - search a single seq against HMM database 

fright' (cJ'llga^lIU Washington University School of Medicine 

hSS^R is freely distributed under the GNU General PubUc Lxcense (GPL) . 

^ fil' ~ /prod/ddm/seqanal/PFAM/pfam4.3/Pfain 
^ence'file: /prod/ddm/wspace/orfanal/oa-script. 6378 .seq 

Query: 21612 

scores for sequonce family classification (score includes^all dentins) :^ 
Model Description 



'short'chain dehydrogenase 1^5.0 1.3e-39 1 
|; ^fa-;iactair>ase Beta-lactamase 

Parsed for domains: cf-nr-*- E-value 

Model Domain seq-f seq-t hmm-f hmm-t score E_v^^^^ 

"i/l 11 '204 1 203 tl 145.0 1.3e-39 

adh_short 1/1 -mi t -1 g g 

beta-lactamase 1/1 222 236 317 331 -1 3-3 

Alignments of top-scoring domains: i>,c n p - i -1p-39 



++ +TGas+GIG+ai'A+ ak+Ga++v+a+ ++ + )t+ 
11 cTVFITGASRGIGKAIALKAAKDGANIVIAAKTAQPHPKLlgtiyTA 57 

akelkelGgndkdralaiqlDvtdeesv.aaveqaverlGrlDvLVNHAG 
a+e+++ GO +al++ +Dv+de++++aave+a++++G++D+LVNNA 
58 AEEIEAVGG---KALPCIVDVRDEQQISAAVEKAIKKFGGIDILVNNAS 103 

giillrpgpfaelsrtmeedwdrvidvNltgvflltravlplmamkkrgg 
+++++++ ♦ d +++VN +g+ 1 ++a++p +kk++ 
104 ---AISLTrTrUn'P---TKRLDU4MN\^RGTYLASKACIP--YLKKSKV 145 

GrIvNiSSvaGrkeg -glvgvpgg . - saYsASKaAvigltrsLAlElaph 
J Mis + 1++V++ +++aY+ +K ++ + ++A E + 

21612 146 AHILNISP plNLNPVWFKqhCAYTIAKYGMSMYVLGMAEEFKGE 189 

gl rVna vaP . GgvdTd< - * 
I Vna P+ ++ T+ 
21612 190 -lAVNALWPkTAIHTA 204 

f 1. from 222 to 236: score 3.3, E = 6.6 



21612 



21612 



21612 



beta-lactamase: domain 1 o 

*->eliaeaakwleaLg<- * 

++ia+aa++++++ ♦ 
21612 222 DIIADAAYSIFQKPK 236 
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The OLEl Gene of Saccharomyces cerevisiae Encodes the d9 Fatty 
Acid Desaturase and Can Be Functionally Replaced by the Rat 
Stearoyl-CoA Desaturase Gene* 

^ (Received for publication, June 12, 1990) 

Joseph E, Stukeyt, Virginia M. McDonough§t and Charles E. Martini 

From the Nelson Biological Laboratory, Bureau of Biological Research. Futgers Universify^ Busch Campus, 
PicsQ tawoy, New Jersey 08855- 1 059 j • 



Strains of Saccharomyt^ cerevisiae bearing the 
del mutation are defective in unsaturated fatty acid 
(UFA) synthesis and require UFAfi for growth. A pre- 
viously isolated yeast genomic fragment complement- 
ing the olel mutation has been sequenced and deter- 
mined to encode the A9 fatty acid desaturase enzyme 
by comparison of primary amino acid sequence to the 
rat liver stearoyl^CoA desaturase. The OLEl struct 
tural gene encodes a protein of 510 amino acids (251 
hydrophobic) having an approximate molecular mass 
of 67.4 kDa, A 257-amino acid internal region of the 
yeast open reading frame aligns with and shows 86% 
identity and 60% similarity to the rat liver stearoyl- 
CoA desaturase protein- This comparison disclosed 
three short regions of high consecutive amino aoid 
identity (>70%) including one 11 of 12 perfect residue 
match. The predicted yeast enzyme contains at least 
four potential membrane-spanning regions and several 
shorter hydrophobic regions that align exactly with' 
simjlar sequences in the rat liver protein. An a2eJ gene- 
disrupted yeast strain was transformed with a yeast- 
rat chimeric gene consisting of the promoter region 
and N-terminal 27 codons of OLBl fused to the rat 
desaturase coding sequence. Fusion gene transform- 
ants displayed near equivalent growth rates and mod- 
est lipid composition changes relative to wild type 
yeast control implying a significant conservation of A9 
desaturase tertiary structure and efficient interaction 
between the rat desaturase and yeast cytochrome ba. 



In animal and fungal cells, monounaaturated fatty acids are 
synthesized via an aerobic process from saturated fatty acid 
precursors by a microsomal membrane-bound thres-compo- 
nent enzyme system involving cytochrome bt, NADH-depend- 
ont cytochrome 6b reductase, and the fatty acid desaturase 



This work was supported by National Science Foundation Grant 
DMB84-17802. Biomedical Reaearch Group Grant RR-07058-2X from 
the Public Health Service, and by a ^ant from the Bureau of 
RiuIuKicui KuKfarch Churles and Johanns Busch Memorial Fund. The 
coflts of publication of thia article were defrayed in part by the 
payment of pape chflrgeB, This article must therefore be hereby 
marked "adveriUement" in accordance with IS U.S.C. Section 1734 
Bolely to indicate thi« fact. 

The nucleotide hequcnce(e} reported in this paper has been submittisd 
to the GenHanh'^/EMHL Data Bank tvith aczcitaion numb<tr{s) 

t Supported by a Charles and Johanna Busch predoctoral fellow- 
ship. 

5 Supported Ijy nn Arthur McCallum predocioral fellowship, 
II To whom correspondence should be addressed. Tel.: 201-932- 
4081 or 20)-873-2752. 



(l-3)..Thr5 complex catalyzes the insertion of a double bond 
between carbons 9 and 10 of the saturated fatty acyl sub- 
strates, palmitoyl (16:0)- and stearoyl (l8;0)-CoA, yielding the 
monoenoic products palroitoleic (16:1) or oleic (18:1) acids. 
Although higher eukaryotes contain polyunsaturated fatty 
acids in their membranes, either synthesized endogenously 
Via A12 and A15 desaturase reactions or obtained from their 
diet, the A9 reaction accounts for all de novo unsaturated 
fatty acid (UFA)^ production in Saccharomyces cerevisiae (4). 

Isolation and characterization of fatty acid desaturase en- 
zymes has proved difficult due to their eKtraordinary hydro- 
phobic nature and tight association with membranes. Al- 
though fatty acid deaaturation was first described using the 
yeast desaturase system, only animal ^9 enzymes have 
been successfully purified to homogeneity (5, 6). At a genetic 
level, only the DNA sequence for the rat liver and mouse 
adipocyte genes have been reported and analyzed (7, 8). Those 
genes were found to encode proteins with 92% identical amino 
acid sequences. 

The A9 desaturase from rat liver has been most extensively 
characterized. It is a protein consisting of 358 amino acids of 
which 62% are hydrophobic (7). The functional enzyme has 
an obligate phospholipid requirement and contains one mol- 
ecule of non-heme iron (5). Effects of chemical modification 
on enzyme function has suggested that arginyl and tyrosyl 
residues are involved in the binding of the negatively charged 
CoA moiety of the substrate and in the chelation of the iron 
prosthetic group, respectively (9). A truncated rat liver A9 
enzyme missing 26 residues from the N terminus is also 
membrane-boun'a and functional (10). 

Yeast mutants bearing the del allele require oleic acid for 
growth and were believed to produce a defective 69 desaturese 
suggesting that OLEl v^as the structural gene encoding the 
enzyme (11). Recently, we isolated and characterized a yeast 
gjenomic fragment containing the OLBI gene of S. cerevisiae 
(12). Replacement of the wild type gene in haploid cells with 
a disrupted form of that fragment resulted in a UFA-requiring, 
nonreverting phenotype. 

In this paper we report the DNA sequence of the 5- cere- 
visice OLE! gene and compare the deduced amino acid se- 
quence of the yeast A9 fatty acid desaturase with that of the 
rat liver stearoyl-CoA desaturase primary sequence. Although 
the proteins encoded are highly divergent, the rat A9 desatu- 
rase gene functions efficiently in S, cerevisiae in place of the 
native yeast gene. Furthermore, predicted structural features 
of the two proteins suggest a model for the topology of the A9 
fatty acid desaturase in the ER membrane. 



* The abbreviations used are: UFA, unsaturated fatty acid; QRF. 
open reading frame; ER, endoplasmic reticulum; kb, kilobaaeCs). 
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MATERIALS AND METHODS 

DSA Manipulations, Meditx, and S(o>n— All tecombinant DNA 
manipuUUons were according to eundard methods (13, 43). Plastnid 
ampUfications and bactenel tranefortnations were pfirformed using 
either Escherichia coli fitjrain HBlOl or XLl Blue (Stratae«ne). Yeast 
transformations were by the method of Ito eta/. (14). Growth analysis 
was performed in synthetic dextro&a mediuni supplemented with the 
appropriate amino acids (23). The genotype of yea$t strain 1^140 
is; a. okJ£^'.:LEU2, ieu2-3. Ieu2-dn2, ura3'&2, hit4l - "\ . 

DNA Sequencing— Overlapping DNA frapuents lying within the 
OLEl Open readine frame were subcloned into pBluescript vectors' 
(Suatagene) In two orientations for «e<iuencing in either direcuon. 
Single-stranded DNA squencing templates were prepared by methods 
supplied by Stratagene. The Ml3(-20) primer waa hybridixed to ' 
eingle-stranded DNA templates and DNA sequencing waB performed 
by the dideo;cy chain termination method of Sanger ct oL (IS) using 
(he modified T7 DNA polymerase, Sequenase (U. S. Biochemical 
Corp.). In two casesi OLEl inurnal oligonucleotides were synthesized 
to facilitate DNA sequence analysis, 

DNA Sequence arid Deduced Primary Sequence Analysii'^OLE] 
DNA sequence and the deduced primary sequence analysis was per- 
formed using the Genetics Qompiit«ff Group (GCG) sequence analysis 
software package (16). Amino acid sequence of the rat liver stearoyl- 
CoA desatvrase was obtained from GenBank.* Primary sequence 
compariflon of the yeast and rat liver desaturases was performed 
using the BestPil analysis program. Hydropathy analysis was accord- 
ing to Kyte and Dooliitle (17). 

OiMtruction of Modifi^ole] AUcUa—KWeW^olcl-SS and olcl~207, 
containing stop codons in Ihe 5' region of the coding sequence, were 
conatrucud similarly. YEp352/OLE4.8 was partially digested with 
Sah or y^coli the cohe&ive ends were made blunt, end plaamids were 
religeted. Following amplification in E. coli, plasmid samples were 
RObjeci to restriction enzyme anolyflifi. CandidaUR lacking the rele- 
vant reatriction site were subject to DNA sequence analysis^for 
verification* 

Cowttrucllon of Epinomol and C<:nirvmenc Plofimtdj* Boarinfi ihx* 
Hat Liver Stearoyl*OoA Dcnaturtxc Gene— A 1.2 -Icb rat liver a9 de. 
wituruM- cDNA rragna-ni oiicoding roniduGK ;)-;J5ft, stop codon, and 
136 l>aBC pair? of the 3'-untrflnalated region was removed from 
piaemid pDsS'SSS (10) by digestion With BamH] end 5cc3 and in- 
eerted into the multiple cloning site of episomal piaemid yEp352. A 
l.O-kb yeast genomic fragment encompassing the promoter region, 
tranBlatlon in'ttiai-lon codon, and the first 27 codons of the OLEl wai; 
isolated via HtndIU/5a/l digestion and ligated in-frame with the rat 
desaturuwB ffagment in YEp352. In this final construct, an eight- 
codon linker derived from the multiple cloning site regions of pUC8 
and YEp352 scparous the yeast N- terminal codons from the rat 
deeaturase sequence. The predicted eise of the fUaion gene product is 
391 amino ncid residues. The yeast-rat fusion gene was then recovered 
via /f iftdIIl/Z?ral digestion and lig«t*d into YCp50 using W mdlll and 
Nru\ restriction sites. Plasmids bearing the ftision gene were ampli- 
fied in £. coli strain XLl -Blue and used to transform the yeast olel 
genenliBrupted strain LS-14C. 

Lipid halation and Fatty Acid AnC[(ys»>— Lipids were extracted 
from whole yeast cells by direct saponification (16). Fatty acid methyl 
estens were prepared by transmethylation with boron trifluoride (19) 
and analyzed by gas- chromatography using a 30-nieter capillary 
column SP-2a30 (Supelco) in a Hewlett-Packard 5710A chromaio- 
graph as previously reported (12). 

RESULTS AND DISCUSSION 

General Features of the OLEl Structural Gene— In a pre- 
vious report it was shown that a cloned 4.8-kb Hinditl yeast 
genomic fragment, but not two subclones of that fragment 
terminating at a central Kpnl region, complemented the olel 
mutation of S. cerevisiae (12). From that Kpnl junction, 
overlapping subclones were used to 'Svalk" through the ob- 
served open reading frame (ORF) in both directions yielding 
the sequence strategy presented in Fig. 1. Both strands were 
sequenced through the entire ORF without ambiguity. 

The DNA and deduced amino acid sequence of OLEl and 
flanking nucleotide sequence is shown in Fig. 2. The ORF is 
1530 nucleotides long. Translation of the entire ORF would 
produce a 510-aroino acid polypeptide having an approxi- 



mately molecular masfi of 57.4 kDa containing 49.2% hydro- 
phobic and 25.7% charged (10.0% acidic and 15.7% basic) 
amino acid residues. 'No consensus iV-glycosylation sites are 
present in the deduced amino acid sequence of OLEl and the 
protein does not appear to contain a cleavable N 'terminal 
signal-sequence. 

Yeaat TATA promoter elements are commonly found 40- 
120 base pairs upstream from transcription initiation sites 

(20) with an average mRNA leader sequence of 52 nucleotides 

(21) . The OLEl promoter region has two consensus TATA 
promoter elements (TATAAA and TATATA) located at po- 
'Sitione -80 and -156 relative to the ORF. A transcription 
initiation event, directed from the TATATA element located 
at -156, could yield a tz'anscript having features consistent 
with the above observations. However, transcription initiation 
directed irom the' TATAAA promoter element located at -30 
could result in an atypically short, nontranslated leader se- 
quence relative to the first in-frame ATG. Furthermore, there 
are three additional in-frame ATG codons within the first 
400 base pair/ of the OLEl ORF at positions 56. 61, and 116 
that could also serve as potential translation start sites (see 
Fig. 1). Due to the close proximity of the first ATG codon to 
the TATA promoter element at -30 and comparison with the 
rat desaturase (discussed below) that showed no significant 
similarities in the first 140 amino acids, we were prompted to 
test for functional OLEl products initiating from these down- 
stream sites. Two modified olel alleles were constructed (see 
"Materials and Methods") that shifted the ORF and intro- 
duced translation stop codons at either position 33 {olel -33) 
or 107 {olel-107)* Both in-frame stop codons were positioned 
before the next available ATG sequence. The olel gene- 
disrupted yeast strain L8-14Cf bearing the deletion allele 
olclS::LEU2t was transformed with either of the above alleles 
on an episomal plasmid and tested for the ability to grow in 
the absence of exogenous UFAs. (Strains bearing this okl 
allele were previously shown (12) to completely lack A9 de- 
saturaae activity as determined by product formation and 
have limited and finite growth potential (4-5 generations) in 
UFA-free medium.) In both cases the transformed strain grew 
only when UFAs were added to the growth medium, which is 
consistent with the first in-frame ATG codon functioning as 
the primary site of translation initiation. 

Yeast arid Rot Liver A9 Enzyme Amino Acid Analysis^A 
computer search of homologies to all current entries in 
GenBank/EMfiL protein data bases identified a single data 
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Fig. 1. OLEl restriction map and sequencing strategy. Pri- 
mary rfistriction sites mappine the 4.8-kb yeaet DNA fragment con- 
Ulning OLEl: Bg, B§1)1\ Bt, BstKlli Hindlll; Hp. Hpa\\ K, KpnJ; 
R, ^coRIi S, Sdllj 5m, Smol; P, PstX; and X. XhoL The position and 
direction of the ISSO-base-long ORF encoding the a9 enzyme is 
indicflt©d by the large arrow above the map. Small arrows below the 
map indicate by size and direction the OLEl subclones used to 
sequence the entire ORF and flanking regionA. 
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1 AC^CTCX^CA^ACC7TATtTAGTtKCCV^X^GCtCTCmCT^CCACTCTTGCTC (9 

<1 A«C>^ACCTWCCCTAnCttACCCCTMCCCCX7CCcicACA*A0GTGGAAXTM^ 120 

121 T«nA>«V^TATXT0CACCWOlTCXTC6GCTCCTaiCTC^^CWiCTC^ llO 

111 ATArAC^TftTfttftTftTGGC^(yiGATCTrCATTCmTATTOTTCTATTrCCATCTT^ 

a«l TAmCTCTTTCecrnWAmifiTATTi^lTAGAXTAGAACATCATACTAATACXTAO 300 

591 T7CT^;G1t;^TCATATX^lAA4CACCACrAAAA^>TTACAACAAAG^tGCCAACT^CI0CA JtfO 

tlfitPiraThr crGly 

3<1 XCTACTATTCMnCAlTCACCACCAATTTCtAAACCATCACTCTOCCAfiCAGTSCCMT 420 
ThTThrlUCl\)Uun«A*pA*p;lft?MProLy9A»pA3p5crAla*erSetCly2l« 

4 21 CTCCAeCAACTCCACTTAACO«AX6CTA\TAmtCCCTJ^CTTT«ya.TAAflAAACCA 4S0 
VAiAj pCluVal Aa pLouThrGXuAla AanlltiLwAlATh rClyUuAs fiLy cly aAl 4 

4ei CCAAQAATTOTAACMTTTTOCTTCTTTAATGGGCTCCAACCAAATCCTTTCCflTCOAA 540 
oArfl 1 lirValAsn01yPncaiyfierVruaatGlyifcrLy*CluafitV*15crV*lGlu 

541 TTCCACAAC^ACCC>fcWXCMA\C/ACWMmCCATCCTCTC £00 
P he A« p Ly f Ly B CI yA» nC iuly p tyj S cr Aan LcuA 5 pjtf gLru Leu Gl uLy 1^* J n 

601 CAA^yuAAAko^cv^CCTAiAACTWAATTCACATCTCCaAA^ £6o 
Cln6luLyflaiuCiuAi*Ly»ThrlyancHlBllc£crCliiGlnProTWhrUiiAin 



1141 CYCATTCOACTCrrATCTCtCCTTA Ci I i l i UA ACCACrArATCCgrCgTTTCATCTAT 1»00 

vUzletreTbrtcoUecy«ClyTyzPnePhaA»nAipTyinetclyClyU>ail6Tyr 



661 '^rGGCACCAACATTTCXACTCCTTCWATCCTTCTTQTTTGTGOTATCCCAATCATT IZO 
ABnTipHA»ClftHHUuX*riTrpUaAfnM£lV*ai*uVfllCyfClyHtttrOhCtXi« 



iJOl CCCCCTTTTAffTCOTOTCTTTCTCATTCAACXXCCTXCCTTTtCCATTAACTCCATWCT lfi«0 
AlAClyPhel laMr«VAUrkeVallleCiaClnAlAThrPh«Cy9l IcAantartV^tAl ft 



'21 CCTTCCTACtTCGCTCTCTCTOGTAAACTACCTTTCCATTTAAACCTTTTCCTmCTCC 7«0 
ClyTrpTyiPhcAlflUuae^CXyt-ysVjlPtoLouHiiLcuAinVilPhtLwPhoficr 



1Z<1 CAnACATCCGTACCC>ACCAnCO^TGACAC>JSC/ACCCCTCCtCACMCTCGATTACT U20 

M 1 iTy r 21 colyititftln? roY hcAopAapAr ghrgThrp r oM:gK» p^anTfipX 1 D r 
i>Z\ CCCATTCTTACrnCCCTCAACOTTACCATMCTTCCACCACCMTTCCC^ 1390 

^361 ACAAACGCTATtAMTCCTMXAATACCXCCCMCTAAGCTTATCAtCTATTTC^ 1449 
A I g KanAl a 1 1« X^rjTrpTyr d itTyr AipP roThr Ly * V« 1 1 Lc 1 1 « t> rl^uThrS M r 

1441 nAflTTCCTClACCATACCACTTCA\C>jJ^nCTCTCAAijiTCCTATTC>AC^ 15 CO 

LeiiVa IczlyLdoMaTyrAapLcuLyi Ly iPhcS crGlnAj ^AXa II oGluCluAia LcU 

ISOl AmAACAAGAACAAAACMCATCAATAAAAACMCracrAACATTAAn 1^«0 
' iltCInClnGluGlnLy«l^JXleAanL7a]tf<LysAlai.y*2lcA3nTrpClyProVjii 

1561 TTt^CTCATWoCCAATGKCCACAAACAAACCnCTTIIOCTAACTCTAAGGAAAACAAQ 1^20 

LvuThrA) pLquPrcMtTrpA'pLysGlnttirPh* t«uAX a Ly» S< r i-yaclbAtnLy s 

U2\ CCTTTCCTTATCATTtCtCCTATTCTTCACCACCTATCTGCTtATATCTCTWACATCCA UtO 
ClyUvValXliiXl49»cCXy;leValMi«A«rValsaT&lyTyr;ieE«rCluNloPro 

1«»1 CCTCCTGAAACnTAATTAAAJWTGCATTACCTAACGACCCfACCAAGGC^ 1740 

ClyclyolxiTKirUuX loLyaThrAl* UuClyl^ya A*p/daTnt Ly 8Al A?hc5c rts ly 

174) CCTCTCTACCOTCACTCAAATaCCOCrCAAAATCTCTTGGCTCATATaAGAGTCC^^ I •00 
ClyVAlTyrAr9HLB3ttrAfnAl&AlaGlhA*nV»lUuMaAlpHocArgVilAl»Vil 

IBOl ATCAAOaAAACTAACJU«:TCt0CtAITAcJ^TCCCTACTAACA0AGCTCAAA7CTACG^^ 1860 
21cl^9CliUBrlyiA«nScrAijntAr9>ictAlaS6rtyfAr(]CiyClUllcTyrClU 



?M CTTTTCTACTACCCTCTCCCTCCTC7TTCTATTACTCCCa£irrACCATAGAT7ATCCTCT l/S 
ValPhcTyrTyfM.ValOiyClyV»lS«rlltThrAXlClyTyr«iaAr9LcuTcp5»r 

t4l CACACATCTtAeTCCCCnt>CTCIICCATTCACA7TATM «0 
Hi»Are5»rTyr£orAUHi.oTrpProL«uAr9ZAiuPh<tlyrAiftXlePhoCJ.yCyBAla ^ 

501 TCCCTTCAA£WCKCCCTAAATOCTO<WCCCACTCTCACACAATTCACCATCCTTACACT 9fi0 
3-)rv*laluCly3crAl4Ly»TrpIrpaiyHliSerHlaArgn*KijHiiArqTyr7hr 

»fil JATACCTTC/XACAtCCrTATCACCCTCCTACACCTCTATQCTACTCCCACATCCCATCC 1020 
.VfpThtUuArgAipPrDTyrAapMaAr^ArflClyUniTrpTyrSerMtaKtflClyTrp 

102] ATCCTmCAAGcCAAATCCAAAATACAACCCTACACCTGATATTACCCATATCACTCAT 14 tO 
Ho t LoU u Ly a P r OA a nP roLy aTy r Ly» Al AAr q Al « Aap 1 1 cjh r A ■ (**CtTh r A ap 



1 0« 1 CATTGGACC ATTACATTCCAACACAOACACTACATC7TCTTCATCTTATTAACCCCTITC 1 1 4 0 
AiptrpThrIUArqPh»Cinni»ArqHAaTytllrLt:Ul«U«fitLCUUttrhtW»Ph« 



USl AC7GCTAAaTTCTTrrAACCATCACATTACAATAACAAAACtCCAACTACCATTAAMAA 1920 
ThrGlyLy*Phc?nrEnd 

1911 AAATTCAAAAATCATAAATTAAAAAAAAAVUkATCAAnCAATTTTT^ 1540 

1963 ACCTTTTCACATTmCC T ' j 1 1 1 1 1 tyCTCTTATTACCATTTACC J 1 tTTTATTTATTTt 204 0 

20*1 TTtCATTTTJ^ATTTTATicnCOTATrrATCTATAGAAATTTTCAmTCATrTACA 2100 

ZlOl TTCAGAnTCCTTATCWTTTTCATTATAtATCTTTTEICACTAACTTTCAeCtfAACTTC 21 CO 

21 «1 7ATTrmATT r T l 1 1 H TCTCCCCCCTGCACCAATACATATCCCATGCCTyACTCCATg J2I0 

Zttl 7CTTCAAAATTTCACACTCATGCTCACCCTTAACnCTCAACCTTTT 12«1 



Pic. 2. Nucleotide and encoded amino acid t»qU6nce of tho A9 fatty acid d&saturaso structural giene, 
OLEL Two conseneuB yeast TATA elementis preceding tbft ISSO-base-long ORF and the first four in-frame 
tnethjonine-cpecific codons Ani ond^rliMiL An OLEl internal region of 268 amino acids displaying Bignificant 
identity to the rat liver ^9 ensyme is delimiud by asterisk^. Potential membrane-spanning regions are highlighted 
with lines above nucleotide and amino acid sequences. 



base entry, the rat liver A9 desaturase, with significantly 
similarity to the OLBl gene product (Fig. 3). The aligned 
Bequencee show 36% identity and are greater than 60% similar 
over the region encompasBing the C-terminal 260 aruino acids 
of the shorter rat protein. No significant einiilatities exist 
over the N -terminal 141 amino acids of the yeast and 99 
amino acids of the rat sequences. The yeast open reading 
frame extends 113 amino acids beyond the C-terminal end of 
the rat sequence. Within the region of high amino acid simi- 
larity there are three segments, having a minimum length of 
10 residues, of very high identity (>70%) beginning at OLEI 
amino acid positions 156. 331, and 368. The most highly 
conserved of these is the first region where 17 of 23 identities 
are observed including one stretch containing 11 of 12 perfect 
matches. 

The most conserved amino acid type within the compared 
repion of the yeast protein ie histidine with 10 of 14 (71.4%) 
residues in perfect alignment. Two other amino acid residues, 
proline and arginine, also show greater than 50% total iden- 
tity. Arginine residues of the rat liver enzyme have been 



yC«9t 141 VrLTSVrtYAVGOVSXTACYHittJHSHRSrSAHk'VtAIJTAXrCCHSVeCS 190 

m 99 7U>fGiPTyuaALGIT%auMiratlnnlgwIi'^UPL7IA}n>t^ro^ 14« 

y««>t 191 AmHcstioaRURrnmjtmOMtPGurTSiai^^ 23b 

r4t 149 vyfHAK>H|OUllUCrSCTHAPPHrt&fUCrrrSKVQ<UVW(^ ISI 

ycdat 235 ,DITDKIW*rr>WWRKriuifl<ttArvipTMCCV^ ?96 

t:.l:..f. :.||:l.f Ml I w m . 

rat 1» UMSDUWOOM^POfaunrRPGUAHCrXXfTl^^ 249 

.Itl r: .(I- ^tt tl..| .IH.. .|;| 1 1 1 : 1 1 

cal 249 WlAYTL^n>UTWlA«BAAI&«yaPYDIM0£A£KILV5^G5VWC^ 29B 



yeast rHiiay7T0'fnu:nanqtu9V^iTtLTiwcUittiim U6 

HI. II II... l..-.t . ;lltll ll.|..t: I t 

rat 299 YJiKAn'nyuficymaxiarrrrPiDawAiaAYbiucxvsiwiv.t^ 947 

y^sn 397 oocQKXTwaicK JJl 

f% 34i KRICPCSURSS 3SI 

Fig. 3. Amino aotd sequence comparison of the yeast and 
rat liver ^9 fatty acid desaturases. A 257'refiidue internal region 
of the yeast A9 enzyme is aligned with thv rat liver stearoyl-CoA 
desaturase. Comparison was prepared by the GCG sequence analysis 
program BestFiL Identical residue matches are indicated by connect- 
in^ solid lines. Two or one point between residues indicate decreasing 
amino acid similarity. Percent bimilarity value ifi based on the number 
of identical and twu-nolnt amino acid ftomDoriRonB. SftRmenta fihftwine 
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Fic. 4. Hydropathy analysis of yeast and rat liver A9 enzymes. Aligned Kyte-Doolittle (17) hydropathy 
profiler of the yeast and rat liver 1^9 desaturase proteins. The presumptive double nnembrane -spanning wqaences 
are indicnied by bold Unt^s above ihow repions. 



(9). significance as e highly conserved amino acid supports 
thia finding. Although the role of histidyl residues in the fatty 
acid desaturafiCB has not been cxanilned, their highly con- 
served'appearance also suRRefitfl an imporunt contribution to 
enzyme function. 

Structural Analysbt and Proposed Topolojly of the V^o-''^ 
F.n7ym(' - fitriJtinjr fiimilarities were alRo observed in the hy- 
dropathic choracUriatics of the two enxymcB (Fig. 4). Both 
proteins contuin two lonK hydrophobic reRionK (-50 refiidues) 
that could potentially form two membrane traversing loops, 
each consi.sting of two tranfimembrane fiepmentft. Chou-faa- 
man alRorithme; predict ^-turn forming potential in the central 
portions of each loop in both the yeast and rat liver proteins. 
Inapcction of the primary fccquences at those cites reveals the 
presence of multiple helix-breakinp; amino acide that could 
serve to disrupt a-helical structure in order to form the looped 
structures. Theae hydrophobic regions are at identical posi- 
tions in the aligned yeaflt and rat sequences. At least three 
smaller hydrophobic recions (each <7 amino acids) are also 
found at identical pusilian-s in Ihe (wn proteins. The repion.s 
of hinh consecutive amino acid identity, however, arc not 
wiii\in U»e hyilruphobic sequences. The fir.st rcpion is located 
between the' two *'tr8nfimembrane loop regions" the second 
and third identity regions ere located at the C-terrninal part 
of the protein past the second *'tTanamembrane loop." Neither 
extension of the N- and C- terminal domains of the yeast 
appears significantly hydrophobic and an examination of the 
amino acid distribution in those regions further suggests that 
they do not contribute to the integral membrane domains of 
the protein, A proposed model of the topology of the yeast 
protein in the ER membrane is given in Fig. 5. Assuming that 
the membrane-spanning regions are confined to the predicted 
hydrophobic sequences that are greater than fiO amino acids 
long, the arrangement places most of the protein on the 
cytoBolic side of the ER membrane. Furthermore, all throe 
rejiionR of hiirh roni^ecutive identity would be located on that 
side of the membrane which is consistent with its proposed 
site of action (22). 

Growth and Fatty Acid Content of Gene-disrupted Yeast 
Transformed with the Rat Liver A9 Desoturose— The signifi- 
cant sequence and predicted structural similarities observed 
between the yeast and rat A9 proteins prompted us to test 
whether the rat enzyme could functionally replace the yeast 
enzyme in S. cereuisiae, although there are additional residucB 
at the N. and C^terminal ends of the yeast peptide sequence 
that are not found on the rat protein. A yeast-rat f\ifiion gene 
was constructed (see "Materials and Methods") placing co- 
dons 3-368 of the rat gene in-frame with the initial 27 codons 
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Fig. f). Model for the orientation of the yeast desaturase in 
the ER membrane. The numbers identify amino acid positions in 
the yeast open reading frame. 

of the yeast gene and promoter sequences separated by an 8- 
codon linker region. This fusion gene was placed on a multi- 
copy epifiomal and single copy centromere-based (CEN) vec- 
tors and introduced into the olal gene-disrupted yeast strain, 
L8-14C. Fusion gene transformants were analyzed for growth 
and lipid composition relative to the seme gene-disrupted 
strain transfoKned with the plasmid bearing native OLEl 
gene. 

Yeast transformants bearing either the native OLBl or the 
yeast-rat fusion desaturase gene (two isolates) on an episomal 
plasmid were found cured of the UFA requirement and, sur- 
prisingly, showed identical growth rates (Fig. GA) indicating 
significant conservation of A9 desaturase tertiary structure 
and an ability of the rat enzyme to interact with the yeast 
cytochrome b.v In addition, because the rat protein is 113 
residues shorter than the yeaet desaturase at the C-terminal 
end and yet can functionally substitute for the yeast enzyme 
in S. cerevisiae, it appears that this extension of the yeast 
protein may be nonessential for catalytic functions. We can- 
not exclude the possibility, however, that the additional resi- 
dues may be involved in other functions that influence its 
catalytic efficiency or optimise interactions with other com- 
ponents of the desaturase system. 

An analysis of stationary phase cellular lipid compositions 
revealed, however, significant differences in the percentage of 
16-carbon fatty acid species in the yeast-rat fusion gene 
transformants relative to the wild type control and, as a result, 
a modest decrease in the percent total UFA (Table I). The 
lower percentage of 16:1 and increased 16:0 species found in 
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Table I 

Fatty acid composition of transforrrud S. cereuisiae 
Stationary phase L8'14C cells transformed with OLE! or the yea^t- 
rflt chimeric gene on multiple (epieomal) or single (CEN) copy 
number plasmidi were harvested and cellular lipidK anolyxod as 
described under "MatarlaU and Methode^^ 
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those strains may reflect a preference of the rat A9 enzyme 
for the 18:0-CoA substrate over l6:0*CoA. 

Although a yeast-rat desaturase fiaaion gene is capable 
of functionally replacing the native OLBl of S. cerevisiae 
when present on a high copy number plasmid, a more strin- 
gent test of the efficiency of the rat protein in yeast would be 
to examine cells transformed with a Single copy of the fusion 
gene. Cells containing the chimeric gene on CEN plasmid 
YCp50 showed growth rates that are reduced approximately 
65% relative to wild type (Fig. 6B). 

Sjmilarly> the lipid composition of CEN plasmid-bearing 
yeast transformants differed markedly between those contain- 
ing the chimeric gene and those containing the cloned yeast 
gene (Table I). The relative UFA levels were reduced approx- 
imately 38% in cells containing the rat gene coding sequence 
and the compensatory relative increase in saturated fatty 
acids resulted in a doubling of the 16:0 content and increased 
14:0 levels, but no significant change in the level of 18:0. 
Thus, the yeast-rat AS desaturase fusion gene can functionally 
replace the native OLEl of S. cereuwioe, although its action 
results in striking differences in cellular fatty acid composi- 
tions. 



In previous studies using gene disruption and lipid analyt- 
ical methods (12) we provided evidence suggesting that the 
OLEl gene encoded the yeast A9 fatty acid desaturase. The 
deduced OLEl amino acid sequence and physical comparisons 
of the yeast and rat liver proteins given here provide further 
proof that the OLEl locus contains the authentic structural 
gene for the desaturase. The aligned regions of consecutive 
identity between these two proteins from widely divergent 
sources suggests that they may represent conserved regions 
with similar function. The finding that the rat A9 fatty acid 
desaturase gene can complement OLEl in S. cenvisia^ al- 
though the two proteins have only 36% identity suggests that 
there is conserved functional interaction among cytochrome 
66-mediated desaturase systems. Thus, BR-bound A9 enssymes 
from other organisms and possibly other cytochrome bs-me- 
diatcd desaturas^s, such as the A12 and A15, may also func- 
tion in yeast. 

Achnowl6df;mcnt^'We wish to thank Philipp Strittmatter for plae- 
mids conUining the rat stearoyl-CoA desaturase gene. 
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An increasing number of plasma membrane proteins have been 
shown to be attached to the membrane via a glycosyl- 
phosphatidylinositol (GPI) moiety. All eukaryotes share a highly 
conserved GPI-core structure EthN-P-Manj-GlcN-PI, where 
EthN is elhanolamine. We have identified a protein encoded by 
the yeast open reading frame YGL142C that shares 33 % identity 
with the human Pig-B protein. Deletion of this essential gene 
leads to a block in GPI anchor biosynthesis. We therefore named 



the gene GPI 10. GpilOp and Pig-B are functional homologues 
and the lethal deletion of GPUO can be rescued by expression of 
the PIG'B cDNA. As found for PlG-B mutant cells, gpilO 
deletant cells cannot attach the third mannose in an linkage 
to the GPI core-structure intermediate. Overexpression ofGPIIO 
gives partial resistance to the GPI -synthesis inhibitor YW3548, 
suggesting that this gene product may affect the target of the 
inhibitor. 



INTRODUCTION 

Glycosylphosphalidylinositol (GPI) anchoring represents a 
mechanism for attachment of proteins to membranes found in all 
eukaryotic cells [1 ,2]. All eukaryotes share the common GPI core 
structure EthN-P-6Mana- 1 ,2Mana- 1 ,6Mana 1 ,4-GlcNa- 1 ,6- 
myo-inositol-P-lipid, where EthN is ethanolamine. Modifications 
of this core structure can occur on different mannose residues 
and depends on the species as well as on its developmental stage. 
The lipid moiety of the GPI anchor is also subject to modi- 
fications, such as variations of the fatty acids, ceramide 
remodelling or the attachment of alkyl chains [1]. 

The complete GPI precursor, whose structure has been solved 
for some species, is assumed to be assembled by sequential 
addition of the sugar components and ethanolamine to 
phosphatidylinositol [3], The complete precursor is transferred 
en bhc to the anchor-attachment site of the protein (t£>-site) with 
concomitant release of a C-terminal peptide [4]. Anchor at- 
tachment is thought to occur as a transamidation reaction [5,6]. 
A protein to be GPI-anchored carries two signal sequences: a 
cleavable N-terminal signal sequence that causes the protein to 
be translocated into the lumen of the endoplasmic reticulum 
(ER), and a C-terminal signal sequence that directs attachment 
of the GPI anchor. The consensus sequence for GPI-anchor 
addition consists of an amino acid with a small side-chain at the 
w-site, two amino acids with small side-chains in positions w-H 1 
and w+2, and a short hydrophilic spacer region followed by a 
hydrophobic domain of about 15-20 amino acids [7], 

The identification of at least eight complementation classes of 
mutants from murine lymphoma and human K562 cells, lacking 
the expression of GPI-anchored proteins at the cell surface, led 
to the characterization of genes involved in mammalian anchor 
biosynthesis [3]. Three complementation classes. A, C and H, are 
blocked in the first step of precursor synthesis, which is the ad- 
dition of GlcNAc to PI [8]. Class J mutants and class L mutants 
are defective in deacetylating GlcNAc-PI to give GIcN-PI [9-1 1). 
Class E mutants are defective in dolichyl (Dol)-P-Man synthase, 



thereby affecting GPI mannosylation and N-linked glycosylation 
[12]. Class B mutant cells are blocked in the addition of the third 
mannose to the intermediate structure [13], the class F mutation 
affects the transfer of the terminal ethanolamine to the third 
mannose [12], and class K mutants accumulate the complete 
precursor without transferring it to the protein [10]. Whereas in 
mammalian cells, several cDNAs that encode components of the 
GPI synthesis machinery {PIG- A, PIG-H, PIG-L, PIG-F, PIG-C 
and PIG-B) have been cloned [1 1,13-17], only genes involved in 
the first step of anchor synthesis (GPII, GPI2, GPI3/SPTI3) and 
in anchor attachment {GAAI, GPJ8) have been identified from 
yeast [18-21]. Yeast mutants that affect various steps of GPI- 
anchor synthesis have also been described [22]. 

The core structure of the GPI anchor contains three mannose 
residues which are all assumed to be transferred from Dol-P- 
Man as donor [23]. The mannose residues were found to be 
attached in different linkages, suggesting that at least three gene 
products are involved in GPI mannosylation. The Pig-B protein 
is required for the transfer of the third mannose in an a-1,2 
linkage to the core structure, but to date, no enzymic activity of 
the Pig-B protein has been shown [13]. A yeast mutant, blocking 
a similar step as the Pig-B mutant, has been isolated [22]. 
Addition of the third mannose can also be blocked using a 
species-specific inhibitor, YW3548 [24]. YW3548 blocks GPI- 
anchor synthesis in mammalian cells and in yeast, but not in 
protozoa. 

In this study, we report the identification of a yeast gene, 
GPIIO, which appears to be the functional homologue of the 
human PIG-B gene. In order to study the function of this 
essential gene, we placed GPUO under the control of the 
GALl /lO promoter. When GPUO expression was turned off, we 
found that inositol was no longer incorporated into proteins and 
that the maturation of Gaslp, a major GPI-anchored protein 
from yeast, was strongly reduced. Under these conditions, we 
observed the accumulation of the GPI intermediate Man^-GlcN- 
(acyl)PI, and of a novel yeast GPI -derived structure that was 
sensitive to treatment with HF and to Jack bean mannosidase, 



Abbreviations used: ER. endoplasmic reticulum; EthN, elhanolamine; GPI, glycosylphosphatidylinosito!: GU. glucose unit. 
' To whom correspondence should be addressed. 
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most likely Man-{EthN-P)Man-GIcN-(acyl)PI. The appearance 
of this novel lipid was blocked by addition of YW3548. a GPI- 
synlhesis inhibitor. Overexpression of GPIW rendered wild-type 
cells partially resislant to YW3548. 

MATERIALS AND METHODS 
Strains and growth conditions 

Cells were grown to saturation in either YPG [3 % (v/v) glycerol, 
2 % (w/v) peptone, 1 % (w/v) yeast extract, 40 mg/I each of 
adenine, uracil and tryptophan] or in SO YE [3 % (v/v) glycerol, 
0-67 % (w/v) yeast nitrogen base, 0.2 % (w/v) yeast extract and 
the required nutrients] and used to inoculate the glucose- 
containing media SDYE [0.2% (w/v) yeast extract, 0.67% 
(w/v) yeast nitrogen base and 5% (w/v) glucose] or YPUAD 
[1.0 % (w/v) yeast extract, 2 % (w/v) peptone, 5 % (w/v) glucose 
and 40 mg/l each of uracil and adenine]. For overexpression 
experiments, cells were grown in YGal medium [2% (w/v) 
galactose, 1 % (w/v) yeast extract, 2 % (w/v) peptone, 40 mg/l 
of each of adenine, uracil and tryptophan]. Sporulation of yeast 
strains was on minimal sporulation medium (1 % potassium 
acetate, 2% Bacto-agar, and the required nutrients to comp- 
lement auxotrophies). 

The full-length sequence of G PI 10 was replaced by the Kan MX 
module using a PCR-based strategy [25]. A knockout cassette 
was constructed using a short flanking homology strategy and 
the oligomers: (upstream) 5' TCTCTCAACGATAGGGTCT- 
GATTTATTAATTTTACTACTGCCGAAACGTACGCTGC- 
AGGTCGACC 3' and (downstream) 5' ATCGATGAATT- 
CGAGCTCGTTATGGTAAGTTAATATCGCTATAAGGT- 
CGCCGTCTATAAATTTAAA 3'. 

This cassette was transformed into the diploid strains FY 1679 
and CEN.PK2. Deletion of the gene was monitored by resistance 
to geneticin (Gibco-BRL) and resistant colonies were confirmed 
by PGR analysis. For overexpression studies from the GALl 
promoter, RH1657 was used. For galactose-depletion studies, 
RH3699 was transformed with pGal-GPIlO and tetrads were 
dissected, generating spore A (RH3994), spore B (RH3995), 
spore C (RH3996) and spore D (RH3997). For in vivo mannose 
labelling experiments, RH3998 and RH3999 were used. All yeast 
strains used in this study are shown in Table 1 . 

Plasmids 

pGal-GPIlO was constructed by PGR amplification of the full- 
length GPIW coding sequence from genomic DNA, with the 
primers (upstream) 5' GATAGTGTAGAATTGTGTAATG 
CAA y and (downstream) 5' GTAAACTGATAAGTGAAG- 
CATGGAC 3', using PFU Polymerase (Stralagene). The PGR 



fragment containing sites for XbaX (upstream oligomer) and Sph\ 
(downstream oligomer) was subcloned into the Xba\-Sph\ site of 
the pSEYG68 vector containing the GALl/ 10 promoter cloned 
into the EcoKl-BamWX site (referred to as pGal). 

p425-PlG-B was prepared by subcloning the full-length PIG- 
5cDNA [13] into the EcoKW site of pBluescriptll (Stratagene), 
and the fragment generated by cutting with Spel and Sal\ was 
li gated into p425 digested with Spe\ and Sa!\. 

[^H]/nyo-lnositol labelfing of proteins 

Wild-type cells or gp/70A::KanMX cells carrying pGal-GPIlO 
were grown in SGYE to saturation and used to inoculate SDYE 
(5% glucose) in which they were grown for about 16 h. Gells 
(5 X 100 were washed twice in SD-inositol medium (5 % glucose) 
[26], resuspended in 500 p\ of SD without inositol (5 % glucose) 
and depleted of inositol for 10 min before the addition of 15 fiCi 
of ['H]m>'o-inositol (Dupont de Nemours, Bad Homburg, Ger- 
many). Cells were labelled for 30 min. A total protein extract 
was prepared by lysing the cells in TEPI [100 mM Tris/HGl, 
pH 7.5/10 mM EDTA/proteinase inhibitors (1 /tg/ml pepstatin, 
1 /ig/ml leupeptin, 1 /tg/ml antipain)] by vortexing for 4 x 1 min 
with glass beads. Total protein was precipitated with trichloro- 
acetic acid and the precipitates were washed with acetone 
and resuspended in Laemmli protein sample buffer. The proteins 
were separated by SDS/PAGE (10% acrylamide) and, after 
incubation of the ge! in 1 M sodium salicylate, the radioactivity 
was detected by fluorography. 

Putso-chase analysis of Gasip and CPY 

Pulse-chase labelling and analysis of immunoprecipitates was 
done as described previously [26]. Briefly, wild-type cells and 
g/7/7(?A::KanMX cells carrying pGal-GPIlO were grown to 
saturation in SGYE and shifted to SDYE (5% glucose) for 16 h. 
Cells were harvested and washed with SD* [5 % (w/v) glucose, 
0.67% (w/v) yeast nitrogen base and the required nutrients]. 
Cells (2.5 X 10' per time point) were resuspended in 1 ml of SD*, 
preincubated for 10 min, labelled with TRANS "S-label (Dupont 
de Nemours) and chased for the indicated times with 0.003% 
(w/v) methionine/0.003% (w/v) cysteine/3 mM (NH^)2SO,. 
Aliquots were taken and the chase was stopped by adding NaN, 
and NaF to a final concentration of 10 mM. Cells were lysed 
by vortexing for 4 x I min with glass beads in TEPI. The lysates 
were boiled in the presence of 1 % SDS for 5 min and centrifuged 
for 1 5 min in an Eppendorf centrifuge. The supernatant was 
added to 5 ml of TNET (100 mM Tris/HCI, pH 8/100 mM 
NaCl/5mM EDTA/1% Triton X-100) and the extracts were 
incubated with polyclonal antisera against Gaslp or CPY and 



Table 1 Strains of Sacchsromyces cerevlsiae that were used for this study 



Strain 


Phenotype 


Source 


FY1679 


Mata/a ura3'52/urB3'52 kalM + frp;A63/ + his3A200/ + 


Eurolan 


Cen.PK2 


Mala/a Ufd3'52/ura3'52 leu2-3, 1 Wleu2-3. J 12 trp1-289/tTph289 his3A 1/tiis3A 1 


Eurofan 


RH3698 


FY1679, but gpi10A::K2mX/GPno 


This Study 


RH3699 


Cen.PK2. but gpiW^vMmVGPiW 


This study 


RH3994 


Mata ura3 S2 /eu2-3. 1 12 tfpl-289 fjts3A pGal-GPM 0 


This study 


RH3995 


Mata 5rpf70A::KanMX ufa3-52 ieu2 3.112 trpl-289 fJis3A pGal-GPIIO 


This study 


RH3996 


Mata ur33-52 /eu2-3. 1 12 trpl-289 his3A p6al-6PM 0 


This study 


RH3997 


Mala ffpf70A::KanMX uf33-52 Ieu2 3.112 trpl'289 his3A pGal-GPIIO 


This study 


RH3998 


Mata pmi40 ufa3 leu2 ade2 trpi tiis4 pGal-GPIlO 


This study 


RH3999 


Mata pnHO gpilOAy.KunUX Ufa3-1eu2 8de2 trpi his4 pGal-GPIIO 


This study 


RH1657 


Mata hts4 1eu2 uf33 fys2 barl 


Laboratory strain 



yeast homologue ol human P/G-B 155 



Protein A-Sepharose (Pharmacia, Uppsala, Sweden) for 3 h at 
room temperature. The immunoprecipitales were washed four 
limes with TNET, once with 20 mM Tris, pH 7.4, resuspended in 
Laemmli sample buffer and analysed by SDS/PAGE, followed 
by exposure and qyahtification of the gel on a Phosphorlmager 
(Molecular Dynamics, Sunnyvale, CA, U.S.A.). The percentage 
of mature Gaslp was determined as the percentage of the total 
Gaslp signal found in the 125 k Da form. 

Gtycollpid labelling in vivo 

["QMannose labelling experiments in vivo using RH3998 and 
RH3999 were performed as described (27). In brief, pmi40 or 
pmi40 gpiJOJ^iiKanMX cells carrying the plasmid pGal-GPIlO 
were grown for 16 h in SDCU medium [5 % (w/v) glucose/ 1 % 
(w/v) peptone/ 0.67% (w/v) yeast nitrogen base/0.1% (w/v) 
mannose, supplemented with 40 mg/1 uracil]. Cells (3 x 10') were 
resuspended in SPCU medium [0.1 % (w/v) glucose, 2% (w/v) 
pyruvate, 0.67% (w/v) yeast nitrogen base and the required 
nutrients], preincubated at a non-permissive temperature for 
pmi40 for 20 min and labelled with 25 fiCx of ["CJmannose for 
45 min. In some experiments, cells were incubated with 5 /iz/ml 
YW3548 (Novartis AG, Basel, Switzerland) for 10 min before 
addition of the radioactivity. The reaction was stopped by 
the addition of 10 mM NaF/10 mM NaNj, and lipids were ex- 
tracted with CHCI5/CH3OH/H2O (10 : 10: 3, by vol.). Lipids were 
desalted by phase partitioning between n-butanol and 0.1 mM 
EDTA/5 mM Tris/HCI, pH 7.5, and analysed by TLC using 
CHCl3/CH30H/H,0 (10:10:3, by vol.) as solvent. For struc- 
tural analysis, the accumulated lipids were extracted from the 
TLC plate, converted into hydrophilic fragments or core glycans 
and analysed by Bio-Gel P4 or high-pH anion-exchange 
chromatography respectively, as described previously [28,29]. 
The hydrophilic fragment generated using HNOj was digested 
with Jack bean mannosidase (Sigma) for 40 h with 2 x 1.5 units. 
The resulting fragment was desalted and purified on a G-15 
column, then analysed on Dionex as above. 
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Figure 1 Sequence alignment of GpilOp and the Pig-B protein 

The sequence alignment was done using the BESTFIT program of the Wisconsin Package 
Version 9.0. Genetics Computer Group (6CG), Madison. Wl, U.S.A. Identical residues are shown 
In twxes. 



Halo assay 

RH1657 cells expressing either pGal or pGal-GPIlO were grown 
to saturation in YGal medium. Cells (5 x 10*) were included in 
10 ml agar plates (0.8 % agar) of the same media and 4 /i\ of 
YW3548 at different concentrations was spotted on the plate. 

RESULTS 

Identification of GPI10 

A database search using the National Center of Biotechnology 
Information BLAST server indicated a large extent of homology 
between the protein encoded by the yeast open reading frame 
YLI42C and the human Pig-B protein. Owing to the results 
presented below, we named this gene GFIJO. The GPIIO gene 
encodes a protein of 616 amino acids with a predicted size of 
about 72 kDa. It shows 33 % identity with and 58 % similarity to 
the human PIG-B gene (Figure I), the potential third mannosyl- 
transferase producing the GPI anchor. In addition, the hydro- 
phobicity profiles of the two proteins are very similar (results not 
shown). 

GPI1Q is essential and its deletion can be rescued by expression 
of the human PIG-B cDNA 

To test whether GPIIO is an essential gene, we constructed a 
KanMX knockout cassette and transformed it into the diploid 



wild-type strain Cen.PK2 to disrupt the GPIIO gene [25J. Upon 
tetrad dissection, we found that only two spores from each tetrad 
could grow (Figure 2A). The growing spores were sensitive to 
geneticin, showing that spores that were deleted for GPIIO were 
inviable. GPIIO was also found to be essential in other strain 
backgrounds (results not shown). 

To study the function of the GPIIO gene product, we con- 
structed a conditional mutant of GPIIO by cloning the gene 
under control of the GALl/10 promoter on a centromeric 
plasmid. In this way, the expression of GPIIO can be turned off 
simply by shifting the cells from glycerol-containing medium 
(SGYE or YPG), in which the promoter is not repressed, to 
glucose-containing medium (SDYE or YPD, both containing 
5% glucose) where it is. The Gal construct of GPIIO (pGal- 
GPIIO) was functional and could restore growth of 
^/7i76>A::KanMX spores (Figure 2B), This construct was used to 
perform GpilOp depletion studies. 

GPIIO had been identified based on its high degree of homology 
to the human PIG-B gene. We therefore tested whether GPIIO 
could be functionally replaced by PIG-B. PIG-B was cloned into 
a yeast multicopy vector (p425) behind the MET promoter, 
which is activated in the absence of methionine. p425-PIG-B was 
transformed into the Cen.PK2 diploid strain which is het- 
erozygous for gpi I OA: .KanMX. The diploid was sporulated on 
minimal sporulation plates and tetrads were dissected onto 
minimal medium plates without methionine. Despite low overall 
spore viability, we obtained spores that carried the plasmid and 
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Figure 2 GPI10 Is essential for growth and Its deletion can be 
complemented by pGal-GPIIO or by p425-PIG-B 

(A) The GPnoc^ reading frame was replaced by KanMX in the diploid Cen.PK2 and tetrads 
were dissected. (B) pGal-GPIIO was transformed into RH3699 and tetrads were dissected. 
Growth on medium containing geneticin was used to monitor the deletion of GPHO. (C) p425- 
PIG-B was transformed into RH3699 and tetrads were dissected onto minima! medium, the 
growth of two spores (A, B. C and 0) is shown: spore A was dismpted lor GPIIO, as monitored 
tjy growth on medium containing geneticin. whereas the wild-type spore, 8. cannot grow on 
genetictn-containing plates. 



which were resistant to geneticin (Figure 2C), suggesting that 
they were deleted for GPIIO. PIG-B expression did not completely 
restore wild-type growth levels in gpilOA : : KanMX strains [com- 
pare spore A (gpiJOA: iKznMX) and spore B (wild type) in 
Figure 2q, but partial restoration of phenotypes is commonly 
seen for functional complementation over such a large evol- 
utionary distance. Expression of PIG-B from a centromeric 
vector was not sufficient to restore growth (results not shown). 
We conclude that expression of the human PIG-B cDNA can 
complement the lethal phenolypc of ^p/70A:: KanMX cells, 
suggesting that the two proteins are functional homologues. 

GpHOp depletion results In a GPI-synthesis defect 

The GALI/lO-controlled GPIW vector allowed us to regulate 
the expression of GPIIO. To find conditions where GpilOp was 
functionally depleted, we followed the growth of the spores of a 
tetrad (Figure 2B) upon shift from glycerol- to glucose-containing 
medium. As shown in Figure 3(A), the growth of 
^;?/7(?A:: KanMX cells slowed down significantly after about 8 h 
incubation in glucose-containing medium, suggesting that 
GpilOp was strongly depleted. For the following experiments, we 
used conditions where GPIIO expression was turned off for 
8-16 h. We had also tried to shift cells from galactose- to glucose- 
containing medium, but observed no change in growth rate over 
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Figure 3 GpilOp depletion causes a GPI-anchoring detect 

(A) The growth of the spores of one tetrad [RH3994 (A), RH3995 (8). RH3996 (C). RH3997 
(0)] was monitored upon shift from glycerol- to glucose-containing medium. (B) Incorporation 
of [^H]m>i3-inositol into proteins is shown for the spores of the tetrad. Cells were shifted from 
glycerol- to glucose-containing medium for 16 h. They were then labelled with [^H]/77>t>inositol. 
followed by preparation of a total cell tysate. analysis by SDS/PAGE and fluorography. (C) 
Pulse-chase experiments were performed in the spores of the tetrad. Cells were shifted from 
glycerol- to glucose-containing medium for 16 h. They were then metabolically labelled and 
chased as indicated. Lysates were prepared which were immunoprecipitated for Gaslp. The 
signal was detected by exposure and quantification of SDS gels using a Phosphorlmager. 
Quantification of these pulse-chase experiments are shown. The appearance of mature Gaslp 
indicates GPI-anchor attachment and transport to the Golgi apparatus. 



a period of 24 h (results not shown). This could be due to a very 
high expression of Gpilp on galactose medium combined with a 
long half-life of the protein. 

Since GPIIO appears to be the functional homologue oi PIG- 
B, we tested whether GPIIO is indeed involved in GPI-anchor 
synthesis. GPI-anchored proteins are the only proteins that 
incorporate }^\\]ptyoAnos\io\ due to the inositol residue in the 
GPI anchor. Cells were grown in SGYE to saturation and used 
to inoculate SDYE (5% glucose). After 16 h growth in SDYE 
(5 % glucose) the cells were labelled for 30 min with ^\\]myO' 
inositol, total protein was extracted and analysed by SDS/PAGE, 
followed by fluorography. We observed that in :: KanMX 

spores (B and D), the incorporation of radiolabelled inositol was 
greatly reduced when compared with the wild-type spores (A and 
C), demonstrating a defect in GPI-anchoring (Figure 3B), 
Furthermore, pulse-chase analysis of the GPI-anchored protein 
Gaslp was performed. Gaslp occurs as a 105 k Da form in the 
ER. Upon arrival at the Golgi, its core glycans are elongated, 
causing a shift in apparent molecular mass from 105 to 125 kDa. 
Transport of Gaslp from the ER to the Golgi depends on 
addition of the GPI anchor [30,31]. After 16 h of shift from 
SGYE medium to SDYE medium (5% glucose), cells were 
labelled for 5 min and chased for various amounts of lime. Cells 
were lysed and either Gaslp or the non-GPI-anchored vacuolar 
hydrolase carboxypeptidase Y was precipitated from total lysates 
using polyclonal antisera. Immune complexes were separated by 
SDS/PAGE and analysed on a Phosphorlmager. As shown in 
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Rgure 4 GpllOp depletion causes the accumulation of Manj-GlcN-CacyQPI 
and of a novel lipid 

pmMO (RH3998) or pmi40 gpilOh : ; KanMX (RH3999). both carrying pGal-GPIl 0^ were shifted 
to glucose-containing medium for 16 h. Cells were labelled lor 45 min with ('^C]mannose. 
foltowed by extraction of total lipids and analysis by TLC. Man^. Manj-6IcN-(acyOPI : MIPC, 
mannosylinositolphosphorylceramide; M(IP)2C, tnannosyldi-inositolphosphofylceramide: arrow, 
rwel lipid; CP, complete precursor: 0, origin. 



Figure 3(C), Gaslp maturation was severely delayed in 
g/7i70A:: KanMX spores (B and D) when compared with wild- 
lype spores (A and C). Under these conditions, maturation of 
carboxypeptidase Y was normal in al! four spores, showing that 
its transport to the vacuole was unaffected (results not shown). 
Therefore, the defect in maturation does not represent a general 
transport defect. These data are consistent with a role for GpilOp 
in GPI-anchor synthesis. 

Pig-B mutant cells are defective for the addition of the third 
mannose to the intermediate structure Man-(EthN-P)Man- 
GlcN-PI [32]. To test if GPIIO also mediates the addition of the 
third mannose to the GPI intermediate, we performed lipid 
labelling experiments in vivo. Wild-type yeast cells incorporate 
little exogenously added mannose. We therefore used cells 
carrying the pmi40 mutation, which is a temperature-sensitive 



allele of phosphomannose isomerase. Upon shift to non-per- 
missive temperature, these cells incorporate exogenously added 
mannose with high efficiency. The pmi40 mutation was intro- 
duced into a gp/y(?A:: KanMX strain containing pGal-GPIlO 
(RH3999). A pmi40 GPIIO strain derived from the same cross 
was used as control (RH3998). Cells were grown to saturation in 
YPG and shifted to SDCU medium (5 % glucose) for 16 h. Cells 
were preincubated for 10 min in the presence or absence of the 
GPI-synthesis inhibitor, YW3548, which causes the accumulation 
of Man,-GlcN-(acyl)PI [24]. After labelling for 45 min with 
["Cjmannose, total lipids were extracted and analysed by TLC 
(Figure 4). In wild-type cells, one can see some labelling of the 
complete GPI precursor and mannosylated sphingolipids. When 
the inhibitor YW3548 was added, a strong accumulation of a 
band comigrating with Manj-GlcN-(acyl)PI occurred [24). In 
Gpil Op-depleted cells, Man2-GlcN-(acyl)PI accumulated even in 
the absence of YW3548. In addition, a more polar lipid (indicated 
with an arrow in Figure 4) was strongly labelled, which was 
sensitive to treatment with GPI phospholipase D, but resistant to 
digestion with PI phospholipase C, suggesting that the lipid had 
an acylated inositol ring (results not shown). The appearance of 
this lipid is strongly diminished by YW3548. From the novel 
lipid and from the Man2-GIcN-(acyl)PI, hydrophilic fragments 
were prepared by nitrous acid deamination and borohydride 
reduction and were analysed by Bio-Gel P4 chromatography. 
Furthermore, neutral glycans were prepared by treatment with 
HF, followed by nitrous acid deamination and borohydride 
reduction, and were analysed by Dionex chromatography (Table 
2). In addition, the deaminated, reduced fragment derived from 
the novel lipid was subjected to treatment with Jack bean 
mannosidase followed by analysis by Dionex chromatography 
(Table 2). The novel lipid is sensitive to treatment with HF, 
contains the core glycan Man^-anhydromannitol and elutes from 
a Bio-Gel P4 sizing column at 7.3 glucose units (GU). On the 
same column, a deaminated, reduced fragment derived from T. 
brucei lipid C (EthN-P-Man^-GlcN-PI) eluted at 7.8 GU. Jack 
bean mannosidase treatment resulted in a shift in the elution 
profile from 7.3 to 7.0 GU and the release of a fragment eluting 
at 0.9 GU (mannose). These findings are consistent with a 
structure Man-(EthN-P)Man-GlcN-(acyI)-PL One additional 
ethanolamine phosphate leads to an apparent size increase of 
about 4 GU as previously demonstrated by comparing the elution 
position of EthN-P-Mang-GIcN-PI and Mang-GlcN-PI [28]. 

GPIIQ overexpression results In partial resistance to YW3548 

As YW3548 prevents the addition of the third mannose to the 
GPI intermediate Manj-GlcN-(acyl)PI and our data suggest that 



Table 2 Analysis of the lipid intermediates accumulated upon depletion of GpllOp 

RH3999 cells were metabolically labelled with I'^ClMan. and total lipids were extracted and analysed by TLC, The bands corresponding to Man^-GlcN-(acyl)PI, the novel lipid (Figure 4. arrow) 
and of lipid C [EthN-P-Man3-GlcN-(acyf)Pll from Trypanosoma brucei v/tte eluted from the TLC material by organic so^ents. Hydrophilic fragments were generated by nitrous acid deamination 
and reduction, wherease neutral core glycans were generated by dephosphorylation. deamination and reduction. Hydrophilic fragments and neutral core glycans were analysed by Bio-Gel P4 size- 
exclusion and Dionex high-pH anion-exchange chronratography (HPAEQ respectively. Partially hyrfrolysed glucose oligomers were used as internal standards. The elution position of the hydrophilic 
fragments and the neutral core glycans are indicated as glucose units (GU) and Dionex units (DU) respectively. J BAM. Jack bean a-mannosidase. 



BiO'Gel P4 Gel filtration analysis ol the hydrophilic fragments Dionex HPAEC analysis ol the neutral core glycans 

Lipid (GU) (OU) 

EthN-P-Man3-GlcN-(acyf)PI 7.8 2.5 

Man3-GlcN-(acyl)PI 3.3 2.1 

Novel lipid 7.3 2.1 

No\fel lipid (JBAI^) 7.0/0.9 1-1 
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Rgure 5 GPI10 overexpression confers partial resistance to the GPt- 
synthesis inhibitor YW3548 

(A) RH1 657 cells carrying pGal or pGal-GPtlO were grown in YGal and incubated with 2 ^g/ml 
YW3548 or methanol and growth was monilored by counting cells In a haemocytometer. (B) 
RH1657 cells carrying p6al or pGal-GPIlO were included in YGal plates and 4 /tl of ditlerent 
concentrations ol YW3548 were spotted on Ihe plates (top. 100 ^g/ml; middle, 10 ;tg/ml; and 
bottom, 1 /ig/ml) belore incubation at 30 **C. 



GFIIO is required for this step, we decided to test whether 
overexpression of GPIIO would lead to resistance to YW3548. 
To address this question, we transformed wild-type cells with 
either pGal-GPI 10 or the empty pGal control vector. The growth 
of cells in YGal medium in the presence of YW3548 or methanol 
as control was monitored (Figure 5A). We observed that cells 
carrying the control vector were unable to grow in the presence 
of YW3548, whereas cells that expressed GPfJO from the 
GALl/10 promoter were able to overcome the effects of the 
compound and grow, albeit more slowly, than untreated cells. In 
a different assay, cells carrying either pGal-GPIlO or pGal were 
included in YGal plates and halo assays were performed (Figure 
5B). Cells carrying the control vector gave a completely clear 
halo upon addition of YW3548. In contrast, overexpression of 
GPIW allowed partial growth in the presence of the compound. 
Whereas the halo was clear in the beginning, it became turbid 
after 2 days of incubation. This partial resistance suggests that 
YW3548 might interfere directly or indirectly with the function 
of GpilOp. 

DISCUSSION 

In this study, we present the characterization of the GPIJO gene 
from yeast. A deletion of the essential GPIJO gene can be 
complemented by expression of the human PIG-B cDNA, 
suggesting that GPIW and PIG-B are functional homologues. 
Both genes appear to mediate the same step in GPI-anchor 



synthesis. Previous studieis have indicated a high degree of 
homology between the mammalian and the yeast GPI-anchor 
synthesis machinery. The gpiS is mutant was shown to be 
complemented by its human homologue [21]. Also, mammalian 
class E mutants that are defective in Dol-P-Man synthase can be 
complemented by the homologous yeast gene [33]. In this study 
we show that a complete deletion of GPIJO can l>e complemented 
by its human homologue, the PIG-B gene. 

As expected from the sequence homology and the functional 
complementation test, GPIIO is involved in GPI-anchor syn- 
thesis. When GPIIO expression was repressed, radiolabelled 
inositol incorporation into proteins was greatly reduced and 
Gaslp maturation was strongly delayed, which is consistent with 
a GPI-syn thesis defect. All other genes that have t>een implicated 
in yeast GPI-anchor synthesis have also t>een found to be 
essential [18-22,34]. 

When GPIIO expression was turned off, Manj-GlcN-(acyl)PI 
as well as a novel lipid were found to accumulate. This 
lipid contained a Man,-anhydromannitol core glycan after 
dephosphorylation, nitrous acid treatment and reduction. Its 
hydrophilic fragment, however, eluted from a Bio-Gel P4 column 
at a much later position, which could be due to an ethanolamine 
phosphate attached to Man^-anhydromannitol. Furthermore, it 
is sensitive to treatment with Jack bean mannosidase. These 
findings suggest that the novel lipid may represent a GPI 
intermediate, most likely Man-(EthN-P)Man-GlcN-(acyI)PI. For 
the moment there is no formal proof that the aqueous HF- 
sensitive group on the mannose is ethanolamine phosphate. At 
this point different possibilities exist: it could represent an 
aberrant structure, which is only synthesized due to extensive 
pressure on the GPI-synthesis machinery imposed by the de- 
pletion of GpilOp, or it could represent a natural intermediate. 
Most GPI anchors isolated from mature yeast proteins do not 
seem to carry an ethanolamine phosphate modification on the 
first mannose [35]. However, in animal cells, GPI molecules have 
been identified with this modification [1], and the mammalian 
PIG-B enzyme most likely acts on a GPI precursor with the same 
carbohydrate structure that accumulated in this study. Previous 
studies, which characterized the complete, radiolabelled GPI 
precursor in yeast before it was added to the protein, would not 
necessarily have detected this modification with the techniques 
used [27]. 

Previously, we suggested that YW3548 blocked the addition of 
the third mannose to the GPI core because when cells or 
membranes were treated with the compound, a Man^-GPI 
intermediate with two mannoses and no ethanolamine accumu- 
lated. Here we have observed that the addition of a putative 
ethanolamine phosphate to the Man^-GPI intermediate was 
prevented by incubation of cells with YW3548. This raises the 
possibility that the inhibition of the addition of the third mannose 
to the GPI core is indirect and is due to the inhibition of 
ethanolamine phosphate addition to the first mannose by 
YW3548. Man-(EthN-P)Man-GIc-(acyl)PI could be the natural 
and highly preferred substrate for GpilOp, the putative a 1,2 
mannosyltransferase that adds the third mannose. In this case, 
the biosynthetic scheme in yeast would be PI -> GlcNAc-PI 
GlcN-PI -» GlcN-(acyl)Pl Man-GlcN-(acyl)PI Man-Man- 
GlcN-(acyl)PI -►Man-(ElhN-P)Man-GlcN-(acyl)PI -» Man- 
Man-(EthN-P)Man-GlcN-(acyl)PI (Man)Man-Man-(EthN- 
P)Man-GlcN-(acyl)PI EthN-P-(Man)Man-Man-(ElhN- 
P)Man-GlcN-(acyl)PI and YW3548 would block the addition of 
ethanolamine phosphate to the Man^ intermediate. Consistent 
with this synthetic scheme we have preliminary data on the 
structure of a GPI product that accumulates in a mutant that is 
defective in addition of the mannose side-chain. This product 
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also appears to have an ElhN-P on the first mannose of the core 
GPI structure (C. Siitterlin, P. Ceroid, R. Schwarz and H. 
Riezman, unpublished work). This newly proposed sequence of 
events in the yeast GPI synthesis pathway would also be 
consistent with the lack of activity of YW3548 on ail protozoa 
tested [24], because these cells do not use this modification on the 
first mannose [1]. Previously, we suggested that YW3548 might 
mimic the Manj-GlcN-(acyI)PI substrate and compete for bind- 
ing of this substrate to the a-1,2 mannosyltransferase [24]. This 
same argument would hold for the phosphoethanolamine trans- 
ferase needed to generate this novel yeast lipid. 

Overexpression of GFJW can confer partial resistance to 
YW3548, as tested by growth in the presence of YW3548. The 
partial resistance could be explained if GpilOp were one of the 
targets of YW3548 or if the real target is the phospho- 
ethanolamine transferase that works on the first mannose. In the 
latter case, GpilOp may utilize the unmodified Man,-GPI only 
poorly as a substrate. Although no mannosyltransferase activity 
of GpilOp and the Pig-B protein have been shown, both proteins 
share significant homology with Alg9p (18% identity, 31% 
similarity). Alg9p is involved in the transfer of the seventh 
mannose in N-glycan precursor synthesis [36], The mannose in 
that case is also donated by Dol-P-Man and is attached in an a- 
1,2 linkage. It is therefore likely that Pig-B, GpilOp and Alg9p 
represent a new family of Dol-P-Man-requiring a-1,2 mannosyl- 
transferases. 

The identification of GPIIO as a gene involved in an in- 
termediate step of GPI -anchor synthesis improves the charac- 
terization of the yeast GPI-synthesis machinery. Functional 
complementation could also allow the cloning of the protozoan 
enzymes which, at least in their native environments, have been 
shown not to be sensitive to YW3548 [24]. Expression of these 
genes in yeast may make it possible to screen for compounds that 
specifically interfere with the addition of the third mannose 
residue in fungi or protozoa. 
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Saccharontyces cerevisiae Cdc7 kinase is essential for 
initiation of DNA replication, and Hslcl, a related 
kinase of Schitosaccharomyces pombCj is also required 
for DNA replication of fission yeast cells. We report here 
cDNAs encoding Cdc7-related kinases from human 
and Xenopus (huCdc7 and xeCdc7, respectively). The 
cloned cDNA for huCdc7 contains an open reading 
frame consisting of 574 amino acids with a predicted 
molecular weight of 63 847 that possesses overall amino 
acid identity of 32% (54% including similar residues) 
to Cdc7 and Hskl. huCDC7 is transcribed in the 
various tissues examined, but most abundantly in testis. 
Three transcripts of 4.4, 3.5 and 2.4 kb in length are 
detected. The 3.5 kb transcript is the most predominant 
and is expressed in all the tissues examined. A cDNA 
containing a 91 nucleotide insertion at the N-terminal 
region of huCDC7 is also detected, suggesting the 
presence of multiple splicing variants. The huCdc7 
protein is expressed at a constant level during the 
mitotic cell cycle and is localized primarily in nuclei 
in interphase and distributed diffusibly in cytoplasm 
in the mitotic phase. The wild-type huCdc7 protein 
expressed in COS7 cells phosphorylates MCM2 and 
MCM3 proteins in vitroy suggesting that huCdc7 may 
regulate processes of DNA replication by modulating 
MCM functions. 

Keywords: CDC7/celI cycIe/DNA replication/ MCM 
proteins/serine-threonine kinase 



Introduction 

initiation of chromosomal replication enVnrvnie*: is 
tiphtlv regulated during the cell cycle oenetic'and uio- 
chemical studies in trie yeast :>accnarbmyces cerevisiae 
have yielded considerable information on molecular inter- 
actions between replication machinery and cell cycle 
regulators (Coverley and Laskey, 1994; Huberman, 1995; 
Kearsey et a/., 1996). The replication origins of Sxere- 
visiae, which are composed of the conserved 11 bp *A' 
or core sequence and auxiliary *B* elements, exist as 
nucleoprotein structures which involve origin recognition 
complex (ORC). MCM, Cdc6 and probably other proteins 
(Campbell and Newlon, 1991; Walker et a/.. 1991; Bell 
and Stillman, 1992; Diffley and Cocker. 1992; Marahrens 
and Stillman. 1992; Bell et a/., 1993; Diffley et al, 1994; 
Rao et aL, 1994; Rowley et ai, 1994. 1995; Theis and 
Newlon, 1994; Fox et ai. 1995; Liang et al, 1995; Loo 



et al, 1995; Rao and Stillman, 1995). It was reported 
previously that these protein-DNA complexes may altern- 
ate between two distinct states during the cell cycle; one 
that exists prior to the S phase and the other that is 
detected during the G2 and M phases. The former pre- 
replicative complex contains ORC and Cdc6 as well as 
MCM licensing factor, while the latter post-replicative 
complex may contain only ORC (Liang et ai, 1995; 
Cocker et al, 1996; Donovan and Diffley, 1996). The 
prjc-replicative complex may be activated by regulatory 
molecules which trigger the initiation of S phase. Genetic 
study of S.cerevisiae has implicated serine-threonine 
kinases in this step, among which the Cdc7-Dbf4 kinase 
complex may turn on the ultimate 'START' signal for the 
S phase (Kitada et al, 1992; Jackson et al, 1993; Dowell 
et al, 1994; Sclafani and Jackson. 1994; Bell, 1995). In 
the presence of active Cdc7 kinase, the S phase can be 
completed in the absence of protein synthesis (Hartwell, 
1974). Cdc7, whose kinase activity peaks at the G|/S 
boundary, activates DNA replication machinery in con- 
junction with Dbf4, which not only stimulates its kinase 
activity but may also tether Cdc7 at the origins of 
replication (Jackson et al, 1993; Yoon etai, 1993; Dowell 
et al, 1994). 

The structures of DNA replication origins and modes 
of their activation have been elusive in higher eukaryotes 
(Hamlin and Dijkwel, 1995). However, identification of 
genes related to ORC and MCM components in Xenopus, 
Drosophila and mammals has strongly indicated that the 
basic components required for initiation of chromosomal 
replication may be conserved in higher eukaryotes 
(TTiommes et al, 1992; Hu et aL, 1993; Chong et a/.. 
1995; Ehrenhofer-Murray et al, 1995; Gavin et a/.. 1995; 
Gossen et al, 1995; Kimura et al, 1995; Kubota et al, 
1995; Treisman et al, 1995; Carpenter et a/.. 1996). We 
previously reported hskl^, whose product is a putative 
Schizosaccharomyces pombe homologue of Cdc7 kinase 
(Masai er a/., 1995). hskl^ is essential for viability 
of S.pdmbe cells, and analyses of DNA content and 
morphology of germinating spores containing hskJ null 
alleles indicated that hskl'^ is required for DNA replication 
as well as for coupling of the M phase to S phase initiation. 
The presence of the structurally and functionally related 
kinases in two distantly related yeast species suggested 
the possibility that eukaryolic DNA replication may be 
regulated through a conserved mechanism which involves 
Cdc7-related kinases. 

' We report here isolation of human and Xenopus cDNAs 
encoding Cdc7-related kinases (huCdc7 and xeCdc7, 
respectively). We show that huCdc7 is a nuclear protein 
kinase expressed at a constant level throughout the cell 
cycle. We also report that huCdc7 expressed in C0S7 cells 
phosphorylates MCM components in vitro, suggesting 
possible regulation of MCM functions by Cdc7-related 
kinase. 
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Fig. 1. Strategy for PCR cloning of CDC7 -related kinases and cloned cDNAs. Two sets of degenerate primers indicated by open arrowheads were 
designed on the basis of the conserved amino acid sequences between the products of S.cerevisiae <Sc,) CDC7 and S.pombe (Sp.) hskl*. Shaded and 
striped regions represent conserved kinase domains and kinase insert sequences, respectively, whereas open areas indicate N- and C-terminal tail 
sequences. A, B, E, K and X represent the sites for AgeJ, BcH, EcoRl, Kasl and Xbah respectively. HA indicates a haemagglulinin epitope sequence. 
The thick bais indicate the segments present in the isolated cDNAs or those subcloned into vectors. Details are described in the text. 



Results 

Isolation of human and Xenopus cDNAs related to 
Sxerevisiae CDC7 and S.pombe hskl^ 

In order to isolate CDC7/hskl ^-Tcizicd genes from higher 
eukaryoles, we performed RT-PCR on mRN A isolated from 
murine embryonic stem (ES) cells, using degenerate oligon- 
ucleotide probes designed on the basis of amino acid 
sequence homology between the products of CDC7 and 
hskl^. A PCR product -420 bp long, generated with a 
set of the primers derived from the conserved amino acid 
sequences of kinase subdomains I and VII, was isolated and 
used as a template for the second nested PCR with primers 



derived from the conserved domains I and VI. The second 
PCR gave rise to one major band of expected size (-350 
bp). Therefore, the 420 bp long PCR product from the first 
PCR was subcloned into the T vector (M archuk et a/. . 1 99 1 ) 
and its sequence was determined. 

Sequencing of this DNA fragment revealed the presence 
of a reading frame that showed substantial homology (52 
identical amino acids from 138 residues) to both Cdc7 
and Hskl. Screening of human and Xenopus cDNA 
libraries using this DNA fragment as a probe led to 
isolation of three human and three Xenopus clones, and 
the inserts of these positive clones were subcloned into 
KS vectors for further characterization (Figure 1). 
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The restriction mapping of the 1.8 and 2.9 kb inserts 
from the human clones #8 and #13 (both obtained from 
a fetal liver library), respectively, indicated the presence 
of an overlapping region between the two inserts. The 
nucleotide sequence of 1756 bp long #8 cDNA revealed 
the presence of a 334 bp 5'-unlransIated region and a 
1422 bp stretch of a coding region. The clone #13 cDNA 
did not contain 5' -untranslated region, but possessed a 
1416 bp stretch of a coding region and a 1334 bp 
3'-untranslaled region together with poly(A) sequences. 
Although a 535 amino acid long coding region could be 
deduced by combining #8 and #13 cDNAs (#6), a start 
codon ATG was not found upstream of the kinase sub- 
domain I. The N-terminal region of clone #8 was identical 
to the portion of the sequence of #1 (obtained from a 
testis library), except that #1 cDNA had a 91 bp deletion. 
This 91 bp deletion would give rise to an open reading 
frame (ORF) that contains three possible ATG codons, 
among which the first (from the 5' end) is used to initiate 
translation of huCdc7 protein (see below). The full-length 
huCDCT cDNA contains a 1722 bp long ORF, encoding 
a 574 amino acid long protein with a predicted mol. wt 
of 63 847 (Figure 2). 

Xenopus clones #25 and #28, obtained from a Xenopus 
oocyte library, were identical and the nucleotide sequence 
of the 1467 bp insert of #25 cDNA showed a 1425 bp 
long ORF frame and a 42 bp 3 '-untranslated region. 
Clone #23, obtained from another Xenopus oocyte library, 
contained an additional eight amino acids at the N-terminus 
including the putative ATG initiation codon, and a 1449 bp 
long ORF, encoding a 483 amino acid protein with a 
predicted mol. wt of 53 509. was deduced by combining 
the three clones. 

Primary structures of CdcJ-related kinases from 
iiigher eukaryotes 

The predicted primary structures of human and Xenopus 
Cdc7-related kinase indicate that their kinase domains are 
highly homologous to those of Cdc7 and Hskl (Figure 
3A), When confined to the kinase conserved domains, 
44% identity at the amino acid level (62% including 
sinular residues) is detected between Cdc7 and huCdc7 
or xeCdc7, whereas 80% identity (90% including similar 
residues) is detected between human and frog. Amino 
acid sequence alignments of the kinase domains of Cdc7- 
related kinases were calculated together with other repre- 
sentative serine/threonine kinases, using the ClustalW 
program, and a hypothetical phylogenetic tree was drawn. 
In the phylogenetic tree. Cdc7-related kinases were classi- 
fied into a subfamily distinct from other kinases, including 
CDK, CKII or MAPK (Figure 3B), indicating that they 
are members of a unique *Cdc7' kinase family. 

Cdc7 and Hsk 1 are characterized by the presence of three 
*kinase-insert* sequences between the kinase domains 1 
and II, VII and VIII, and X and XI, designated as kinase 
insert I. II and III, respectively. huCdc7 and xeCdc7 also 
contain two amino acid insertions at the same locations 
(corresponding to kinase inserts II and III), although the 
presence of kinase insert I was not obvious in the human 
and frog clones. The lengths and sequences of the kinase 
inserts are not conserved between yeasts and higher 
eukaryotes as they are not between the two yeast species, 
although weak homology is identified between the human 
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Ul t 



I 



4*« 



II CCC CAT CCT TCT CAT CAA OCA CCT ATT TCA CAC AAC ACT CAC CAT AAA CCT TCT TCC CTC ISO* 



401 4 



lS«t CTTCAAACACCTCCACCACAATACTCXOCCAATTCATTrAAAAACCCCCATACTAATAOC !»•• 

»iv«TppcoTj«»«ricBcos»a si* 

t3«l TSTCACCATTCTTTTCATOCWIAAtMXAATTTACAACeCTCCAATCAeCIkCCTCAT U1* 

511CSIICrO«TPT«tlCll»I»»0 »4« 

1(31 CAACCTTATCACCTCCITCATAAACTTClXCATCTAAATCCAOCTTCfcACAATXACfcCC* KM 

9ttKATDLI,DKLtetM»*SKITA M 



IIOI CAA CAA OCT TTC TtC CAT CCA TTT TTT AAA CAT ATS ACC TTC TCA TA ATC CAT CTT CAT 

5*1 KCALtnrpPKeiitL- 



1T4« TrAATCTTrACTCTTATCAOCT>eAATlUU>AAJ>CAJttll CIIfClA AlACCpyCAA t>IlCllUIllA CACA< 

laiO ATTAATAAT TOT T H A ACATTTP 4 H. IT f W HXCA CATTCtAAAATATACArrAACAATACTTAAAATOCCTOCCAT !•♦» 

1900 ACTTCTTCOCACTAAOkACATC AM ' H. 1 1 l<.A CTTAAACCT»>OCTX>CT>CATTTP> a. VOCC IH- i H A TTAOCTCACA Ijn 

1)00 TrrmCCTl C CCTlUtTDtfCTTTC>CICACAlAt»CACAAAAAOCACC>CTrT»CTTTTyTIAATT»AAATTAACAC 1039 

lOtO ATCTCATCACCATT*AATCAATCAAAACACttAATTTCTACATtt f H rACACHATCAOCtACCTATAL rmX<XAAA JUt 
CICAJ CV im. m. lm-i<.4> ». I lA ACAATTTTCtAAATXAACAACAtAA l 1 lU. U IB. lA OCCtACATATTAOCCCTT >11» 

131* TTATCAACACTAAAACAATe»CaUOtT C TTOCT O tTOOOCCAAA C TATC*CTTAAAATTCAATTCATCCATTTTTAAAAA 3trf 

13M AO^CITCATCAAAOCAT niTOCTCTQAATfUCCATTI'l'l I W. I IM. HXX I ICTCAAI IIJV 1 HH. I K K tOCOXOC 13Tt 

I3M C OtAAfMOM P L r .M^ ■ A AA^ TT*OT W TP'r" ; *r r*'^ * H.1IHA11A AATAITCTK4X11.II1ACT 343* 

1440 CTTOCCAAlTTKAAATAAT»CATAC*CTOCTTCAT10UkOCCAA«TSCAOCACCAACCTC 133« 

33t« I H H. 1 li 1 1 L CAgTTCTATATTTTTOCAOXJCATT. TTACATXT»ATTACyT>CCrXTCA;>ACCAAAACTCAATACACA 341* 

HID ATTCA t.\)CTA T lA tTCCACATTT rT VLTtICC C lA CA 4XCAnXA CATCTVTtfTA B.\. ' i<.l I I ' lC ACTAACH, I lACCTC !(,« 

)-,«« rmrrT'' * »• ^'-'J-* " »p- »iTTt? T»m-* » H > trr WW AAfcATAOOgtmLtATTeACAAAIOCTATATAT 3T1« 

]la« CAATCAOUgATATCAAACTTCCTATOCCAfcAAACTCTO C T C CCTOCTCAOCTCACAeATTrCCCATTr^CTACTCATXS li3« 

1M» AATAOCAAATACTTTAOOCACAlVtA t TCA 1 1 Fit. 1 lAl f I ILACCATICAXACCICAClATXTCTACCTAAttTl. TTT 3»3» 

194B CCTAACTATPmCCATAT^TAAACCATPCCATICAT V W. H. 11 a lUCCA TAATCTT^AAAAAAAATrCA AlC CTCT lCA AT 3»1» 

3010 TTCTATATt CAATAAA? TTAT^ 



Fig. 2. Nucleotide sequences of a full-length huCDCT cDNA and 
predicted amino acid sequences. The A residue of the putative first 
methionine codon is taken as + 1 . The position and nucleotide 
sequence of the 91 bp insertion is also shown. The poly(A) addition 
signal is double-underlined. 
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1 huCdcT MEASLCIOMDEPMAF SPQRDRFOKECSLKK HEQUFKl* 

2 z«Cdc7 MSS CD-N _ 

3 ScCdc? MTSKTKH IDDIP PEIKEEKIQLYB DlJCIEKEtTC^D^^^lTSi^jn^ 

4 SpCdc? -VbvEABITLSPKVTH EQQTDIDSECEirEV DDENVHEHKSQEMIQ DIPARDREEIEMITR TFVELQEMT!@1^^^^^ 



■AGVKKDIEKLYE AVPQI^NVTigfEpKlfiro 
SGVAKEIEKLYA AWQUmi^H^^^^^^^i^^j^^ 



79 
48 
54 
89 



Xlna»a Insert 1 

1 buCdc7 0 VC 

2 x«Cdc7 R SC 



<II> 



<III> 



<V> 



A SpCdc7 YCRTflKOHOIQSEVL KESSFCKEKIPVNED SKraKtida^tiO^li^T^f 

* *.. ********. * . . * *.*. .** * 



3 ScCdc7 C . IdTKKTASHFW NXftSH 



140 

109 
126 
179 



<VI> <VII> )clnaa« insert 2 

1 hucdc? semj:$^s^^vmmm^^ seaqqercsqnkshi 

2 x©Cdc7 fcFAb'iL^tSmtki-Kbfi^^^ 

4 SpCdc7 oiBQTTmS^^ 



230 
184 
201 
241 



1 buCdcT ITCHKIPLSGPVPKB LDQQSTTKASVKKPT TMAQIQIKOGKDCKB CSVCLSVQRSVFGER MFNIHSSISHESPAV KLMKQSKTVDVLSRK 320 

2 xeCdc7 PKK QD CLVCSSTQRSVFCER imiVHSAVTIDNTTL KAAKPSKTIDVTTRK 234 

3 ScCdc7 TD HTANTNHDCCYSMR H HEQFCPCIMRH QYSPMSHHQTPPMVT 244 

4 SpCdc7 - — TD -CRQ Q— SHSCRCTHSHAX ELAHDFSIAQ 269 



<VHI> <IX> 

1 buCdc7 

2 xeCdc7 

3 ScCdc7 IQNG-KWHLNWVNC VDLTKC YPKNE T WlIKR>;(m^ii!:^?nPi^ 

4 SpCdc7 ETSLC TIKND T WSKRMWACTOta^j^^^^ 



410 

323 
315 
325 



•*«* *** * ** 



<X> kinase insert 3 

1 huCdc7 RYPFTI?R5PT)j;Taa^^^ KTPGKSII>CSKBVP AQDLRK LCERLRC K DSSTPKLTSDIQGHA 

2 xeCdc7 K&^^MW&^^^^^^^WlQ^S KCFCKSVLCSKBLP SKDLRT LCECl-RSAIVLPMCH QHDIQKQRAALQMRI 

3 ScCdc7 ^^■^^sWMaf^^iSJ^^GV^^KCX ALHC UZFEXSG 1.IWDKPNGTSNCLKE FVYDLLHKECT ICTFPEYSVAFETFC 

4 SpCdc7 OTKEsi^W^M¥ii«^^KSftQCA ALHC CTFBTN VSTLTEKRVHFR 



483 
403 
397 
377 



<XI> 

1 buCdc7 SHOPAISEKTDHKAS CLVQTPPCQTSCNSF KKGDSNSCEBCFDET -NTHIXG*niEyPpEAi:;?)roiJJ)K3a»EK^ 

2 xeCdc7 KENQ DCWFLPESPDITPDS PAWRSSCVSTSDNM ZQSKimGmiySl^MWBF^^^^WM^^ 

3 ScCdc7 FI>Q CELHDRMSIEPQL PDPKTKMDAVDATEL KKTQEEIWS&|HititfC|;j^O^™ 

4 SpCdc7 -r KLILRASCCSASITX EiaJ^HKPSQ£|^liel£bS^&i£DO^ 



572 
482 
472 
436 



1 lioCdc7 SL r 574 

2 xeCdc7 R <83 

3 ScCdc7 NENTTLLD — CES TDEDD-- WSSSEADLL DKDVLLISE 507 

4 SpCdc7 NIATEKKDODTAFDM SFCETSTEKDEOLTA KHLSHILDFKEQEET DKPTSLSKRKRSIOE ZLPMDAI>QDGA 507 



B 



- ScStc? 

huMAPKKS 



- huMek-1 

— huMek-2 

ScCdcS 

huPfk 



— xeCdc7 
-ScCdcT 



-SpCdc7 



- ScCdcSe 



-huCdlt2 
— SpCdc2 
huErkI 
huEr1c2 
ScFus3 



- YpkA 



Fig. 3. Comparison of CDC7-related kinases from various organisms. 

(A) The predicted amino acid sequences of four CDC7-related kinases 
are aligned for maximum homology. Amino acid numbers are shown 
on the left. The roman figures above the sequences indicate the 
locations of the conserved kinase domains as previously proposed 
(Hanks et al, 1988). which are boxed in grey. The asterisks 
indicate the amino acid residues conserved in all four proteins. 

(B) A hypothetical phylogenetic tree. Amino acid sequence alignments 
were calculated using the QusialW program, and a phylogenetic tree 
was drawn by the SINCA program (Zuckerlandl and Pauling and 
neighbour-joining methods). huCdc7, xeCdc7, ScCdc7 and SpCdc7 
indicate the Cdc7r related kinases from human, Xenopus, S.cerevisiae 
and S.pombe, respectively. COT, cot oncogene product; YpkA, 
enterobacterial protein kinase. 



and Xenopus proteins. The kinase insert II of huCdc7 or 
xeCdc7 is 163 or 108 amino acid long, respectively, and 
57 amino acids are identical, with 1 1 additional similar 
residues. The kinase insert III of huCdc7 or xeCdc7 is 98 



or 95 amino acids long, respectively, of which 34 amino 
acids are identical and 14 additional amino acids are 
similar. 

Another feature of Cdc7 and Hskl is the presence of 
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Fig. 4. Genomic Southern analysis of C£)C7-re!ated genes in various 
species. Genomic DNAs were digested with EcoRL Membrane was 
hybridized using huCDC7 cDNA (probe A» nucleotide residues -132 
to + 1535; see the legend to Figure 5) as a probe, and washed in 
0.1 X SSC and 0.1% SDS at 50°C, 



the C-terminal regions which are rich in acidic residues. 
Two-hybrid assays indicated that Dbf4 protein could 
interact with this C-terminal tail of Cdc7 protein in 
budding yeast (Jackson et aL, 1993). The C-terminal 
regions of both Cdc7 and Hskl are essential for the 
functions of these two kinases (our unpublished data). 
Unexpectedly, huCdc7 and xeCdc? did not contain similar 
C-terminal tails (see discussion below). 

Genomic Southern and Northern analysis 
Southern analyses of genomic DNA from various eukary- 
otic species using two EcoRl fragments (0.7 and 1.1 kb) 
derived from huCE>C7 cDNA containing the amino acids 
1-511 as a probe (probe A) indicated that they hybridized 
with DNA fragments of human, monkey, rat. mouse, 
dog, cow, rabbit and chicken under a stringent washing 
condition (O.lx SSC and 0.1% SDS at 50X), but that 
they did not hybridize with those of budding yeast even 
under a relaxed washing condition (2X SSC and 0.1% 
SDS at 42°C) (Figure 4). 

Northern analyses of mRNA from various tissues 
(Figure 5) showed that huCDC7 transcripts, which were 
detected in most tissues examined, are expressed at a high 
level in testis and at a moderate level in thymus, spleen, 
placenta, brain and heart. Three transcripts of 4.4, 3.5 and 
2.4 kb in length were detected, among which the 3.5 kb 
transcript was ubiquitously detected. The 2.4 kb transcript 
was seen only in testis, and did not hybridize with a probe 
specific to the C-terminal region of huCDC7 cDNA 
(Kpnl~Xba\ fragment containing the amino acids 538- 
574; probe B), indicating that it is an alternatively spliced 
form lacking the C-terminai coding region. Transcription 




huCt>C7 



prob* A prob* B 

Fig. 5. Expression of huCDO mRNA in various human tissues. 
Membranes were hybridized with probe A (upper) or probe B (lower). 
Probes A and B include the huCDC7 cDNA fragment from nucleotide 
residue -132 to +1535 and thai from +1611 to +2202, respectively. 
The shaded regions indicate the kinase insert sequences. 

of a mouse Cdc7 -related kinase also exhibited similar 
tissue distribution, with the highest expression in testis 
(our unpublished data). 

The 3.5 kb transcript was predominant and ubiquitously 
detected in various human cell lines such as HL60, K562, 
M0LT4, Raji. SW480 and HeLa (S3) (data not shown). 
The level of huCDC7 transcription was similar between 
these cancer cell lines. The 4.4 kb transcript was also 
detected in most cell lines, albeit at a much lower level. 

Identification of endogenous huCdc? protein 

Endogenous huCdc7 was detected by specific antibodies, 
which were raised against GST fusion proteins containing 
segments of huCdc7 protein. The mouse monoclonal 
antibody 4A8 specifically recognized a single protein 
which was expressed by the huCdc7-expressing plasmid 
in COS7 cells and migrated with an apparent mol. wt of 
68 kDa (Figure 6 A). huCdc7 protein was inmiunoprecipit- 
ated by the 4A8 antibody from the cell lysates prepared 
from a factor-dependent myeloid leukaemia cell line. TF-1 . 
and was identified by Western blot analysis with the rabbit 
polyclonal antibody #1 (Figure 6B). In order to determine 
which of the three possible ATG codons is utilized to 
initiate translation of the huCdc7 coding frame, we 
expressed both long and short forms of huCdc7 (pKU- 
long-huCdc7 and pKU-short-huCdc7 initiated from the 
first and third ATG. respectively; Figure 1) and compared 
their migration on a gel with that of the endogenous 
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Fig. 6. Expression of huCdc7 protein in tissue culture cells. 
(A) Specificity of the monoclonal antibody, 4A8. Nuclear extracts 
prepared from COS7 cells transfecied with pKU-long-huCdc7 (lane 1) 
or with the vector (lane 2) were separated on SDS-PAGE and blotted 
in 1 ng/ml monoclonal antibody 4A8. (B) Expression of huCdcT 
protein at different cell cycle stages. The whole cell lysates were 
prepared from 10' TF-I cells or from 1.5X10^ C0S7 cells which had 
been transfecied with I jig of long or short fonn of huCDC7 
expression vector and had been harvested 2 days after uansfection. 
huCdc7 protein was inununoprecipiated with the monoclonal antibody 
4A8, (Q Salt and DNase I extraction of nuclear huCdc7 protein. K562 
cells were extracted with hypotonic buffer and nuclei were extracted 
with buffer containing DNase I or NaCI at the concentrations 
indicated. The supernatant (S) and pellet (P) were separated by 
centrifugation. Immunoprecipitates (B) and protein fractions (Q were 
separated on an 8% SDS-PAGE, transferred onto Immobilon paper 
and were blotted with anti-huCdc7 polyclonal antibody #1. 



prolei''. ' Hie.endoge nous huCdc7 nroiein co-mierated with 
^ longJoxini.J"^*ica"rmg Itlartne hrsi ATG initiates 
translation of huCSc . *XFIfeure 6B). 

The amount of huCdc7 stayed relatively constant at 
various phases of the mitotic cell cycle in TF-1 cells 
synchronized by mimosine or nocodazole or after depletion 
of a growth factor, human granulocyte-macrophage 
colony-stimulating factor (GM-CSF) (Figure 6B). 

Nuclear localization of huCdc7 
huCdc7 is localized mainly in nuclei in the interphase and 
is present diffusibly in the cytoplasm in the mitotic 
phase of HeLa cells, as indicated by immunofluorescence 
analyses using the monoclonal antibody 4A8 (data not 
shown). In order to examine whether the nuclear localiz- 
ation of huCdc? changes at the Gi/S transition, HeLa 
cells were synchronized at the Gj/S boundary by double 
thymidine block and the S phase cells were obtained at 
4 h after the release into the cell cycle. In this experiment, 
soluble proteins and phospholipids were extracted with 
Triton X-100 before fixation (Fey et al, 1984). Cells 
were stained by anti-bromodeoxyuridine (BrdU), anti- 
proliferating cell nuclear antigen (PCNA). anti-mouse 
CDC21 (MCM4) and anti-huCdc7 (4A8) antibodies. BrdU 
was incorporated into 100% of the S phase cells, indicating 
thiat DNA was being synthesized synchronously (Figure 
7B). PCNA accumulated in nuclei as the S phase pro- 
gressed (Figure 7D). On the other hand, MCM protein was 
localized in nuclei before DNA synthesis, and disappeared 
from nuclei during the S phase, as expected from its 
licensing function required for *once and only once* 
replication in the S phase (Figure 7E and F) (Blow and 
Laskey. 1988; Blow. 1993; Kimura et aL, 1994; Chong 
et al, 1995; Kubota et al, 1995). huCdc7 was found to 
be localized in nuclei from the Gi through the S phase, 
and no obvious relocation of huCdc7 upon progression 
into the S phase was detected (Figure 7G and H). More 
than 50% of the nuclear huCdc7 protein was extracted in 
buffer containing 0.5 M NaCI, but remained in the pellet 
after digestion with 2 mg/ml DNase I (Figure 6C)- 
Similarly, MCM3 was extracted by salt but not by DNase 
I, as previously reported (data not shown; Kimura et al, 
1994). Therefore, it is likely that the majority of huCdc7 
prepared from a random culture binds to some nuclear 
structures rather than to chromatin. 

huCdcl phosphorylates MCM proteins 

Studies on yeast and Xenopus DNA replication indicated 
that eukaryolic replication origins may be associated with 
ORG and MCM protein complexes, and functions of these 
origin-associated proteins may be regulated in a cell cycle- 
specific manner (Blow, 1993; Yan et a/., 1993; Kimura 
et a/.. 1994; Chong et aL, 1995; Todorov et aL, 1995; 
Coue et al.y 1996). Therefore, these origin-associated 
proteins could be the targets of phosphorylation events 
essential for Gj lo S transition (Bell, 1995; Carpenter 
et al, 1996; Leatherwood et ai, 1996). The Cdc7-Dbf4 
kinase complex in budding yeast is likely lo be bound at 
the origins due lo the origin-binding activity of Dbf4 
protein (Dowell et al, 1994). suggesting the possibility that 
ORC and MCM may be phosphorylated by Cdc7 kinase. 

We examined whether MCM components can be phos- 
phorylated by huCdc7 kinase. Extracts were prepared from 
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Fig. 7. Nuclear localization of huCdc? does not alter before and after G|/S transition. HeLa cells weie synchronized at the G,/S boundary by double 
thymidine block (A. C. E and G) and then released for 4 h to proceed into the S phase (B, D. F and H). (A) and (B) are superimposed images of 
propidium iodide staining of DNA (red) and BrdU staining (green). PCNA (C and D), MCM4 (E and F) and huCdc? (G and H) are immunostained 
with specific antibodies. Magnification; 200X (A and B), 400 x (C-H). 



C0S7 cells transfecled with a vector, haemagglutinin 
(HA)-tagged wild-type huCDC7-expressing plasmid, or 
HA-tagged K90R kinase-negalive huCDC7-ex pressing 
plasmid. Roughly equal amounts of transiently expressed 
huCdc7 protein were immunoprecipitated from the wild- 
type and K90R Iransfeclants by anti-HA antibody, while 
huCdc7 was not detected in the immunoprecipilate from 
vector-transfected C0S7 cells (Figure 8C). Upon incub- 
ation of the immunoprecipitales with (y-^^PlATP in the 



presence of purified GST fusion protein containing 
Xenopus MCM2N (amino acid residues 1-559) or human 
MCM3 (amino acid residues 1-808), the MCM proteins 
were phosphorylated efficiently by the wild-type huCdc7 
immunoprecipitate, while the level of phosphorylation by 
K90R huCdc7 was no more higher than the vector control 
(Figure 8A and B). The results indicate that MCM2 and 
MCM 3 proteins can be phosphorylated by huCdc7 in vitro 
and suggest the possibility that functions of the MCM 
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Fig. 8. In vitro phosphorylation of GST-MCM proteins by huCdc7 
kinase. C0S7 cells were transfected with 1 of either pKU3 vector 
(lane 1), pKU-HA-iagged WT huCDC7 (wild-type, lane 2) or 
pKU-HA-tagged K90R huCDC7 (kinase- negative fonn, lane 3). Two 
days after the transfeclion, cells were lysed and transiently expressed 
huCdc7 protein was immunoprecipitated with anti-HA antibody. 
(A) The immunoprecipitates were incubated with [y-^^PJATP in the 
presence of cither GST-fused Xenopus MCM2N (aniino acid residues 
1-559) or GST-fiised human MCM3 (amino acid residues 1-808). The 
reaction mixmres were resolved by 10% SDS-PAGE, and the gel was 
dried and auioradiographed (B) ^^P incorporation into the MCM 
protein bands was quantified by Image Analyzer (Fuji Film) and the 
values relative to the vector control, which is taken as 1. are shown. 
(C) Immunoprecipitates used for the kinase assays were blotted with 
the anti-huCdc7 polyclonal antibody #1. 

complex are regulated by phosphorylation by the Cdc7- 
related kinase. 

Discussion 

Presence of CDCl-related kinases in higher 
eukaryotes 

Replication of eukaryotic cells is controlled precisely 
during the cell cycle. In spite of apparent diversity in 
structures of replication origins in higher eukaryotes in 
comparison with those of a lower eukaryote such as 
S.cerevisiae, there appears to be striking conservation in 
proteins required for the processes of assembly of replic- 
ation machinery (Gavin et al, 1995; Hamlin and Dijkwel, 
1995; Donovan and Diffley, 1996). Proteins related to 
components of ORC, a protein complex bound specifically 
to the budding yeast replication origins, were discovered 
in Drosophila^ Xenopus and human. Furthermore, they 
also form a multi-protein complex containing protein 
components similar to those of budding yeast (Gavin 
et fl/., 1995; Gossen et al, 1995; Carpenter et al, 1996). 
MCM proteins, originally discovered in Sxerevisiae, were 
identified as components for the licensing factor essential 
for DNA replication in Xenopus egg extracts (Chong et ai, 
1995; Kubota et al, 1995). Proteins related to all the six 
MCM components have been identified in human as well 
(Hu et aL, 1993). Thus, basic mechanisms of initiation of 
chromosomal replication as well as its regulation may be 
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conserved from yeasts to human. We previously reported 
hskJ*, a putative homologue of CDC7 from a distantly 
related yeast, S.pombe^ and suggested the possibility that 
the S phase initiation in eukaryotes may be regulated in 
a conserved manner involving Cdc7-related kinases (Masai 
et al, 1995). Isolation of Cdc7-related kinases from human 
and Xenopus, reported in this study, further strengthens 
our proposal that Cdc7-relaied kinases are the key regu- 
lators for initiation of DNA replication conserved in 
eukaryotes. 

The Cdc7-related kinases from higher eukaryotes share 
structural similarity to the yeast counterparts, exhibiting 
42-44% identity in the conserved domains for serine- 
threonine kinases in addition to the presence of two kinase 
insert sequences at the conserved locations (Figure 3A). 
The Cdc7-related kinases were grouped into a subset 
distantly related to other kinases in a phylogenetic tree 
(Figure 3B), indicating that Cdc7-related kinases belong 
to a distinct kinase subfamily. Unexpectedly, huCdc7 and 
xeCdc7 did not carry a C-terminal acidic region, which 
was present and essential for the activity in the yeast 
genes (our unpublished data). In budding yeast Cdc7, the 
C-terminal tail is involved in interaction with Dbf4 protein 
(Patteron et al, 1 986), We recently discovered that efficient 
interaction with Dbf4 protein requires the kinase insert II 
and III sequences of Cdc7 (our unpublished data). There- 
fore, a putative 'activator' for huCdc7 may well interact 
with the two kinase insert sequences in the absence of the 
C-terminal tail. An alternative possibility is that an as yet 
identified variant of Cdc7-related kinases, which does 
contain a C- terminal region, may be present in higher 
eukaryotes. 

Expression of huCdcl in various tissues and in the 
cell cycle 

mRNAs for huCdc7 are expressed in most tissues 
examined (Figure 5), as expected from its essential func- 
tion for cell proliferation, although the highest expression 
was detected in testis. This may be interesting in the light 
of a previous report (Sclafani et aL, 1988) and our 
unpublished observations that mRNAs for budding yeast 
CDC7 and fission yeast hskl^ are induced during the 
course of meiosis, suggesting that Cdc7-related kinases 
may play additional roles during meiosis. Characterization 
of budding yeast Cdc7 mutants indicated that Cdc7 is 
required for synaptonemal complex formation during mei- 
osis (Sclafani et aL, 1988). A huCdc7 transcript was also 
detected in brain. The functions, if any. of huCdc7 in 
mature neuronal cells which do not proliferate remain to 
be investigated. 

Among the three transcripts detected, the 3.5 kb tran- 
script was ubiquitously present in all the tissues, whereas 
the 4.4 kb transcript was seen in subsets of tissues, such 
as testis, peripheral blood leukocytes (PBL). thymus, 
spleen, small intestine, brain and placenta. In various 
human cell lines, the former was the major transcript and 
the latter was expressed at a lower level (data not shown). 
The 2.4 kb transcript was detected only in testis, suggesting 
that it may be specific to this tissue. Northern analysis 
using the C-terminai region of huCDC7 containing only 
the kinase domain XI as a probe (probe B) showed that 
the 2.4 kb transcript did not hybridize with this DNA 
segment (Figure 5). In accordance with this observation. 
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we have oblained from a testis library a variant cDNA 
whose coding frame is truncated at amino acid position 
519. A mouse cDNA for Cdc7-related kinase that we have 
isolated from a, spermatocyte hbrary contained the kinase 
subdomains l-VII. but its coding region was truncated in 
kinase insert 11 and continued into unrelated sequences 
which were identified on the mouse genomic DNA 
upstream of the remaining kinase domains (VII-XI), 
suggesting that these cDNAs are products of ahemative 
splicing (our unpublished data). On the other hand, clone 
#8 contained a 91 bp insertion at the N-terminal coding 
region, thus resulting in frameshifting in translation. This 
insertion occurs at an exon-intron junction of the murine 
CDC7 gene (our unpublished data). At present, we do not 
know the functions of these apparently kinase-inactive 
derivatives of Cdc7-related proteins. 

In lysates from human cell lines such as HeLa, TF-I 
and K562, huCdc7 protein was identified as a 68 kDa 
protein that co-migrated with the polypeptide expressed 
from an expression vector carrying the 574 amino acid 
long huCdc7 cDNA. The level of expression of the 68 kDa 
huCdc7 protein did not vary significantly during the course 
of the mitotic cell cycle (Figure 6B). Transcription of 
budding yeast CDC7 and fission yeast hski^ was also 
previously reported to be relatively constant during the 
cell cycle (Yoon et al, 1993; our unpublished data) 

buCdc7 IS a nuclear protein 

Indirect immunofluorescence staining using anti-huCdc7 
monoclonal antibody (4 A 8) showed that fluorescence was 
confined mostly to the nucleus during the interphase. 
huCdc7 was localized in nuclei before and after DNA 
replication is initiated, and continued to stay in nuclei 
during the entire interphase (Figure 7). Localization of 
huCdc7 in nuclei did not coincide precisely with that of 
PCNA, which is known to co-localize at the replication 
foci. Further analysis is needed to determine the precise 
subnuclear localization of huCdc7. huCdc7 protein could 
be extracted by high salt, but not by DNase I (Figure 
6C), suggesting that it may be associated with nuclear 
structures. 

Cdc7 may regulate MCM function by 
phosphorylation 

MCM appears to be phosphorylated at various stages of 
the cell cycle. The newly synthesized PI (mouse MCM3) 
is phosphorylated in the Gj phase, and the level of its 
phosphorylation increases during the S phase (Kimura 
€t al, 1994). Phosphorylation of Xenopus MCM4 (Cdc21) 
in early S phase was also reported (Coue et aL, 1996). 
Thus, phosphorylation of MCM may activate its functions 
for S phase initiation. Alternatively, phosphorylation of 
MCM may block re-replication by dissociating from 
the chromatin after the initiation of S phase. Specific 
phosphorylation of MCM subunits by huCdc7 (Figure 8) 
supports the idea that this kinase regulates DNA replic- 
ation. huCdc7 may regulate both activation of the S phase 
and *once and only once' replication through phosphoryl- 
ation of MCM subunits as well as that of as yet identified 
substrates. In budding yeast, it was shown recently that 
MCM2, 3, 4 and 6 could be phosphorylated by the Cdc7- 
Dbf4 kinase complex in vitro (A.Sugino and B.K.Tye, 
personal communication). Experiments are in progress to 



determine whether other MCM subunits are phosphoryl- 
ated by huCdc7 and to locate more precisely the phos- 
phorylation sites on MCM proteins. MCM functions 
could be regulated through sequential phosphorylation and 
dephosphorylation by multiple kinases and phosphatases. 
It would also be important to understand how phosphoryl- 
ation of MCM by a Cdc7-related kinase is coordinated 
with that by other kinases, which may include Cdks 
and DNA-dependent protein kinase, to achieve precise 
regulation of MCM functions for progression of the 
cell cycle. 

The in vitro kinase activity of S.cerevisiae Cdc7 is 
strictly dependent on the presence of Dbf4 protein, and 
Cdc7 alone expressed in insect cells is inactive. Similariy, 
huCdc7 alone expressed in insect cells did not show any 
phosphorylation activity (our unpublished data), sug- 
gesting the requirement for an activator for kinase activity 
of huCdc7. Although we were able to measure huCdc7- 
dependent phosphorylation after overexpression. we had 
difficulties in measuring the kinase activity of endogenous 
huCdc7 protein. This may reflect scarcity and/or instability 
of the active huCdc7 kinase complex in the cells. Identi- 
fication of the putative activator for huCdc7 will help 
to understand the precise roles of huCdc7-dependent 
phosphorylation in mammalian cell cycle progression. 

In summary, we report here the presence of Cdc7- 
related kinases in higher eiikaryotes and present data 
implicating this kinase in regulation of mammalian 
chromosomal replication. Further characterization of these 
Cdc7-related kinases should provide important insights 
into molecular mechanisms of cell cycle regulation of 
chromosomal replication in higher eukaryotes. 

Materials and methods 

Cells 

Mouse ES cells were cultured on a 0.1% porcine skin gelatin-coaled 
culture dish in high glucose Dulbecco's modified Eagle's medium 
(DMEM) containing 0.1 mM non-essential amino acids (Gibco). 2 mM 
L-glutamine (Irvine Scientific). 1 X nucleotide mix (3 mM each of dATP. 
dCTP, trip, dGTP and I mM dTTP), 50 mM 2-mercaptoeihanol 
(Sigma), the supernatant from human leukaemia-inhibiting factor (hUF> 
expressing COS? cells and 20% fetal calf senim (PCS). COS? and HeLa 
cells were cultured in DMEM containing 10% PCS. TP-1 cells were 
cultured in RPMJ 1640 containing 2 ng/ml hGM-CSP and 10% fCS. 
K562 cells were cultured in RPMI 1640 containing 10% PCS. 

Oligonucleotides 

The oligonucleotides used for degenerate PGR amplification for cloning 
of CDCTZ-related kinases were designed on the basis of the amino acid 
sequences conserved between the producu of S.cerevisiae CDC7 and 
S.pombe hskl^. The combination of the oligonucleotide primers that led 
to the isolation of a mammalian Cdc?-related kinase was 5'-CXjGAATT- 
CAA(AG)AT(TCA)AA(AG)GA(TQAA(AG)AT-3' and 5'-CGGAATT- 
aGaA(GA)ICC(AG)AA(AG)TC(ATGQAC-3\ corresponding to the 
amino acid stretches from 34 to 39 and from 186 to 181, respectively, 
of Cdc7 protein. The other nested combination was 5'-CGGAA- 
TTCAA(AG)AT(TCA)GG(TCGA)GA(AG)GG(TCGA)AC-3' and 5'- 
CGGGATCCIGG(TQTT(AGT)AT(AG)TC(TCGA)C(TG)(AG)TG - 3', 
corresponding to the amino acid stretches from 38 to 43 and from 166 
to 161 (Patteron et al., 1986). 

Antibodies 

Portions of the huCDCT? coding frame (amino acid residues 128-276 
and 128-433), isolated as Sau3A fragments, were subcloned at the 
BamH] site of pGEX-3X to generate GST fusion proteins #1 and #2, 
respectively, which were purified as previously described (Ikeda ct ai, 
1996a,b). Polyclonal antibodies #1 was developed in rabbit against the 
purified fusion protein #1 and the antibodies reacting to the GST portion 
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of the fusion protein were depleted by glutathione-Sepharose 4B resin 
to which non- fused GST protein was attached. Mouse monoclonal 
antibody {4 A 8) was developed against the GST-huCE)C7 fusion protein 
#2. Ami -rat PCNA antibody was purchased from MBL (Nagoya, Japan) 
and anii- mouse Cdc21 rabbit serum was kindly provided by Dr H.Kimura 
(Hokkaido University, Japan). Anti-BrdU antibody containing DNase I 
was purchased from Amersham. Anti-HA antibody 12CA5 was purchased 
from Babco (CA). 

GST~MCM fusion proteins 

GST-human P I (MCM3) (amino acid residues 1 -808) and GST-Xenopus 
MCM2 (amino acid residues 1-559) were gifts from Eh" H.TaJusawa 
(Osaka University, Japan). 

mRNA isolation and reverse transcription 
PoIy(A) RNA was isolated from mouse ES cells using a FAST TRACK 
mRNA isolation kit (Invitrogen). Reverse transcription was perfoimed 
with Superscript H (Gibco) as suggested by the manufacturer. One ^1 of 
the 20 ^1 reaction mixture was used for subsequent PGR amplification. 

PCR screening and subcloning of amplified fragments 

PGR, isolation and subcloning of amplified DNA fragments were 
performed as described earlier (Masai et ai, 1 995). The isolated fragments 
were subcloned into the T vector prepared from KS (pBIuescript) vector 
(Marchuk et a/., 1991). Plasmid DNAs containing insert DNAs were 
recovered from white colonies of DH5a on LB plates containing 
ampicillin (50 jig/ml) and Xgal (40 ng/ml), and the nucleotide sequences 
of the inserts were determined. One of the clones carrying an insert of 
-420 bp contained a coding frame that resembled a part of Cdc7 and 
Hskl, which was designated pKS-420. 

Screening of cDNA libraries 

A human fetal liver cDNA lambda library was kindly provided by Dr 
Soma (Kirin Brewery Co.. Japan) and a human testis cDNA lambda 
library was purchased from Ciontech. Xenopus oocyte cDNA lambda 
libraries were kindly provided by Dr Douglas Melton and Eh- Tun Hunt. 
A total of 10* plaques from each library were screened with the '^P- 
labelled 414 bp PCR-ampIified DNA fragment isolated from pKS-420. 
Clones #8 and #13 from the fetal liver library and #1 from the testis 
library carried 1 .8. 2.9 and 0.8 kb cDNA inserts, respectively, and were 
analyzed further. Similarly. #25, #28 and #23, which carried a 1.3, 1.3 
and 1.35 kb cDNA insert, respectively, were isolated firom the Xenopus 
libraries. These inserts were subcloned into KS vector and the entire 
nucleotide sequences were determined. 

Northern and genomic Southern analysis 
Human multiple tissue Northern blots and zoo blot (£coRl digestion) 
were purchased from Clontech. '^P-labelled DNA probes were prepared 
by random priming reactions on the mixture of the 0.7 kb and 1.1 kb 
EcoRl fragments derived from the huCTXT? #8 clone containing amino 
acids 1-511. Hybridization was carried out in a buffer containing 6x 
SSPE, 10% formamidc, 5x Denhardt's. 0.1% SDS and 100 ^g/ ml heat- 
denatured sonicated salmon sperm DNA (Sigma) at 42°C for 12 h. For 
cross-species hybridization, the filters were washed at 42X in 6x SSC 
followed by washing in 2x SSC at the same temperature. For more 
stringent hybridization, they were washed further in 0.2X SSC at 42°C 
and finally at 50°C in 0. 1 x SSC. The washing buffer contained 0. 1 % SDS. 

Construction of vectors that express full-length wild-type 
and mutant forms of buCDC? 

The 0.7 (N-terminal) and 1.1 kb (C-terminal) EcoKl fragments constitut- 
ing the 1 .8 kb insert of #8 cDN A (lacking a portion of the C-tcrminus 
coding region) were subcloned into KS vector, resulting in KS(0.7) and 
KS(l.l), respectively. The N-terminal coding region missing in the #13 
cDNA was reconstmcted by replacing the 0.25 kb Sacl-Bc!l fragment 
of #13 with that (0.6 kb) of KS(0.7). generating KS-huCDC7-#6 which 
contained the entire coding frame of huCDCT? with a 91 nucleotide 
insertion al amino acid position 39 as well as the 3' -untranslated region. 
The spliced form containing the entire coding region in-frame was 
constructed by replacing the Age\~Kas\ fragment of KS-huCDC7-#6 
(containing the 91 nucleotide insert) with that of lesiis-derived #1 cDNA 
(without the insert), resulting in KS-fulI huCDC7. The N-terminal Notl- 
Age\ fragment of the KS-full huCDC7 was replaced by the fragment 
generated by Not\~Age\ digestion of a PCHl- amplified DNA, resulting 
in the HA-lagged 'short* huCdc7 coding frame (starting from amino 
acid position 13; KS-short huCDC7). This PCR was conducted by using 
an oligonucleotide containing Not\-Nde\ sites followed by the sequences 
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encoding the 10 amino acid HA peptide (MYPYDVPDYA) and the 
huCdc7 coding region (amino acids 13-17) in combination with an 
iniemai primer 5'-TTT GTC CTTC AAT CTT AAT CTT-3' present 
downstream of the Age\ site. The 2.3 kb Noi\-Xba\ fragment of KS-short 
huCDCr? containing the HA-tagged short form was cloned into pKU3 
(a gift from Dr Muto of our laboratory), a neomycin-resistant derivative 
of pMEI8S. resulting in pKU3-HA-short huCDC7 which expressed HA- 
laggcd 562 amino acid huCDC:7 under the SRa promoter. For constmction 
of a plasmid expressing the full-length huCDO. the 180 bp £c<?RI- 
A^^l fragment of KS-full huCDCT? containing the N-tcrminal huCDC? 
coding frame (from amino acid position 1 to 22) was further digested 
by A/uI (at 31 nucleotides upstream of the first ATG), and a Not\ linker 
was attached to this Alu\ site. This Not\-Age\ fragment replaced the 
same fragment of pKU3-HA-short huCDC7. resulting in pKU3-long 
huCDC? containing the 574 amino acid full-length huCDC? coding 
frame. To generate pKU3-short huCDCT, the same Not\-Age\ fragment 
of pKU3-HA-short huCDC7 was replaced by an oligonucleotide con- 
taining a Notl site followed by the sequence encoding 10 amino acids 
from position 13 to 22 of huCDCl, 

A mutant form of huCDCT/ in which the lysine at position 90 was 
replaced by arginine was constructed by the PCR method. Oligonucleo- 
tides 5'-AAAATTGCCTTAAGACACTTGATTCCAACA-3' and 5'- 
TCAAGTGT(nTAAGGCAATTTTCTCTTCAG-3' were used to create 
the mutation. 

Transfection of plasmid DNA into mammalian cells 

Plasmid DNAs were introduced into COS7 cells by clectroporaiion as 

previously described (Kitamura et ai, 1991). 

Cell synchronization and preparation of cell lysates 
HeLa cells were synchronized at the Gi/S boundary by double thymidine 
block as described (O'Connor and Jackman, 1995). Factor-depleted TF- 1 
cells were prepared by deprivation of hGM-C^F for 16 h and the Gj 
phase TF-1 cells were obtained by mimosine treatment as described 
(O'Connor and Jackman, 1 995). Cells were synchronized in metaphase by 
nocodazole treatment (O'Connor and Jackman, 1 995). Synchronization of 
the cell cycle was monitored by flow cytometry (Giunta and Pucillo, 
1995). Oils were washed with ice-cold phosphate-buffered saline (PBS) 
and were resuspended in IP buffer (50 mM HEPES/KOH (pH 7.6). 
150 mM NaQ, 2.5 mM EGTA, I mM EDTA, 1 mM dithiothreiiol 
(DTT). 0.1% Tween-20, 10% glycerol, 1 mM phenylmethylsulfonyl 
fluoride, 1 fig/ml aprotinin, 1 jig/ml leupeptin. 100 \iM p-glycerophos- 
phate, 10 mM NaF and 1 mM Na3V04] at 2XI0' cells/ml. After 
sonication of the cell suspension, lysates were cleared by centrifugation. 
Preparation of nuclear extracts and salt/DNase I extraction were con- 
ducted as described by Kimura (1994). 

Immunoprecipitaion, immunoblot and in vitro kinase assays 
Anti-huCDC? antibody #1 or anti-HA antibody (12CA5) was added to 
the cell lysate at a final concentration of 1 ^ig/ml and incubated for 2 h 
on ice. Immunoprecipitates were collected on protein A-agarose beads, 
washed three times with IP buffer. For inununobloi analysis, bound 
proteins were extracted by boiling with 20-40 jil of 2x Laenunli's 
sample buffer. Proteins were separated on an 8% SDS-PAGE and were 
blotted onto an Inmiobilon-P membrane (Millipore). The membranes 
were probed with anti-huCDC7 followed by incubation with horseradish 
peroxidasc-conjugated anti-rabbit IgG (Amersham). The immunoreactive 
proteins were detected by chemiluminiscence (ECL. Amersham. UK). 
For in vitro kinase assays, immunoprecipitates were washed further by 
pre-kinase buffer [40 mM HEPES/KOH (pH 8.0) and 40 mM potassium 
glutamate] and incubated in 24 |il of kinase reaction buffer [40 mM 
HEPES/KOH (pH 8.0), 40 mM potassium glutamate. 1 mM EGTA. 
8 mM magnesium acetate, 2 mM DTT, 0.5 mM EDTA, 0. 1 mM ATP 
and 2 nCi of (v'^PlATP] in the presence or absence of 1 jig of GST- 
fused MCM protein at 30**C for 20 min. The reaction was stopped by 
addition of 6 fil of 5X Laemmli's sample buffer. Samples were heated 
at 95°C for 5 min and were separated on a 10% SDS-PAGE- The gels were 
dried and phosphorylated proteins were delected by autoradiography. 

Indirect immunofluorescence 

Cells grown on coverslips were washed twice with PBS containing 
I n\M CaCl2 and 1.5 mM Mga2. Cells were fixed for 5 min with 3.7% 
formaldehyde and permeabilized with 0.2% Triton X-1(X) in PBS for 
5 min followed by incubation in blocking solution (0.2% gelatin. 5% 
FCS and 0.1% Tween-20 in PBS) when extraction of soluble proteins 
was not necessary. For BrdU staining, cells were incubated with 20 jiM 
BrdU for 20 min prior to the staining. In case soluble proteins needed 
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lo be exuactcd, cells were washed twice with PBS and once with CSK 
buffer (100 mM NaCI, 300 mM sucrose, 10 mM PIPES (pH 6.8) and 
3 mM Mfia2] (Fey ft at., 1984), permealized by 0.5% Triton X- 1 00 in 
CSK buffer and fixed with 3.7% formaldehyde at room temperature for 
5 min. The coverslips were incubated in blocking solution for 10 min 
and for a further \ h after addition of the first antibody (5 Mg/ml) and 
washed three times with the same solution. They were incubated further 
with a second antibody [rhodamine-conjugated goat anti-rabbit antibody 
or rhodamine-conjugated goat anti-mouse antibody (Immunotech, 
France) or fluorescein isothiocyanate- conjugated rabbit anti-mouse anti- 
body (Zymed)} which had been diluted 1:250 in blocking solution. 
Finally, the coverslips were washed three times with PBS and 
then visualized under the immunofluorescence microscopy (Nikon 
Opliphoi-2). 

Accession numbers 

The DDBJ/EMBL/GenBank accession numbers for huCDO and 
XeCDC7 reported here are AB(X)3698 and AB003699, respectively. 
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ABSTRACT 

We report the discovery, tissue distribution and pharmacolog- 
ical characterization of a novel receptor, which we have named 
H4. Like the three histamine receptors reported previously (H1 , 
H2, and H3), the H4 receptor is a G protein-coupled receptor 
and Is most closely related to the H3 receptor, sharing 58% 
identity In the transmembrane regions. The gene encoding the 
H4 receptor was discovered initially in a search of the GenBank 
databases as sequence fragments retrieved in a partially se- 
quenced human genomic contig mapped to chromosome 18. 
These sequences were used to retrieve a partial cDNA clone 
and, in combination with genomic fragments, were used to 



determine the full-length open reading frame of 390 amino 
acids. Northern analysis revealed a 3.0-kb transcript in rat testis 
and intestine. Radioligand binding studies indicated that the H4 
receptor has a unique pharmacology and binds pH]histamine 
(K^ = 44 nM) and pH]pyri!amine (K^ = 32 nM) and several psy- 
choactive compounds (amitriptyline, chlorpromazine, cyprohepta- 
dine, mianserin) with moderate affinity (K, range of 33-750 nM). 
Additionally, histamine induced a rapid internalization of HA- 
tagged H4 receptors in transfected human embryonic kidney 293 
cells. 



Histamine is a monoamine neurotransmitter thus far 
known to activate three G protein-coupled receptors 
(GPCRs), the HI, H2, and H3 receptors (HUl et al., 1997). 
Although molecular cloning has made possible the identifi- 
cation, isolation, and characterization of the majority of 
known GPCRs, the histamine receptor subtypes have proven 
more difficult to identify. The first two histamine receptor 
genes cloned were HI (Yamashita et al,, 1991) and H2 (Gantz 
et al., 1991). The identification of the H3 receptor came 
nearly a decade latter (Lovenberg et al,, 1999). Collectively, 
the HI, H2, and H3 receptors share less than 35% identity 
with one another and each has greater sequence identities 
with other aminergic receptors. Thus, the histamine receptor 
gene family is significantly divergent and may have evolved 
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from different ancestral genes (Leurs et al,, 2000). In addi- 
tion, there is evidence that multiple subtypes of the H3 
receptor may exist. Pharmacological studies performed on 
membranes extracted from rat brain tissue revealed two 
classes of H3 binding sites (West et al,, 1990; Leurs et al,, 
1996). However, efforts to clone a second H3 receptor subtype 
in the brain have thus far been imsuccessful. 

Through molecular cloning techniques, we have identified 
numerous novel GPCRs, including many subtypes not sus- 
pected to exist on the basis of pharmacology. Over the past 
decade, our cloning efforts have identified a nimiber of GPCR 
genes, including genes encoding such receptors as the cystei- 
nyl leukotriene CysLT2 (Heise et al., 2000), galanin GalR2 
and GalR3 (Kolakowski et al., 1998), thyrotropin-releasing 
hormone TRH-R2 (O'Dowd et al., 2000), uridine nucleotide 
receptor UNR (Nguyen et al., 1995), as well as a large cohort 
of orphan GPCRs for which the endogenous ligands remain to 
be elucidated (Marchese et al., 1999; Lee et al,, 2001). In a 



ABBREVIATIONS: GPCR(s), G protein-coupled receptor(s); HTGS, high-throughput genomic sequence; TM, transmembrane; NCBI. National 
Center for Biotechnology Information; PGR, polymerase chain reaction; HA, hemagglutinin; NEK, human embryonic kidney cells; Kb, kilobase pair. 
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scan of the GenBank high-throughput genomic sequence 
(HTGS) database, we identified a small DNA firagment that 
had greatest homology to the known histamine receptor 
genes. Here we report the discovery, tissue distribution, and 
pharmacological characterization of human DNA encoding a 
receptor, H4. H4 receptor mRNA had discrete and limited 
expression in rat testis and intestine. Although the H4 re- 
ceptor was most closely related to the H3 receptor [58% 
identity in the transmembrane (TM) regions], it seemed to 
possess a unique pharmacology, with highest affinity for 
psychoactive drugs (amitriptyline, chlorpromazine) with a 
tricyclic structure. 



Materials and Methods 

Database Searching. A customized search was used to query the 
GenBank HTGS database of high-throughput genomic sequences 
maintained by the National Center for Biotechnology Information 
(NCBI) with the sequences of various GPCRs, using the FAST.PAN 
program (Retief et al., 1999). 

Screening and Construction of the H4 Receptor Gene. Par- 
tial sequences encoding the H4 receptor were used to design primers 
to PCR-ampUfy H4-encoding fragments from human genomic DNA, 
A fragment encoding the start methionine to TM2 was ampHfied 
using primers PI (5'-ATGCCAGATACTAATAGCACAATC-3') and 
P2 (5'-CACAAAGAAGTCAGAGATGG-3') and another fragment en- 
coding from TM5 to TM6 using primers P3 (5'-TGGTACATCCTT- 
GCCATC-3') and P4 (S'-TATGGAGCCCAGCAAA.CAG-S'). PGR 
products were extracted with phenol/chloroform, precipitated with 
ethanol, and electrophoresed on a low-melting agarose gel. DNA in 
the expected size range were excised from the gel, ligated into the 
£;coRV site of the pBluescript vector (Stratagene, La JoUa, CA), and 
then the sequence was determined. H4 receptor-encoding fragments 
were used to screen human and rat genomic libraries (CLONTECH, 
Palo Alto, CA) and a human testis cDNA library (CLONTECH), as 
described previously (Marchese et al., 1994). Isolated phages from 
the human and rat genomic libraries were purified and subcloned, 
and the sequence was determined as described previously (Marchese 



et al., 1994). Isolated phages frx)m the human cDNA library were 
subjected to PGR amplification using primers specific for regions 
flanking the insert of the AgtlO library vector, subcloned, and se- 
quenced. 

A DNA fragment encoding the full-length human H4 receptor was 
amplified by PGR in three stages using isolated human genomic and 
cDNA Ubrary phage as templates. In stage 1, three overlapping 
fragments (A, B, and C), together encoding the full-length H4 recep- 
tor, were PGR-amplified as follows. Fragment A was obtained using 
primers P5 (5'-CATGATTTGATGTGATGGCA-3') and P6 (5'-GAAA- 
GGAATGGAGATGACAGCCACAAAGAAGTCAGA-3') erom an iso- 
lated genomic library phage obtained with the Met-TM2 encoding 
DNA probe. Fragment B was obtained using primers P7 (5'-GT- 
GATCTCCATTCGTTTG-3') and P8 (5'-TCCAATAAATATTCAT-3') 
from an isolated testis cDNA library phage. Fragment C was ob- 
tained using primers P9 (6'-TGGTACATCCTTGCCATG-3') and PlO 
(5'-GAGGTGAGAAAATTGTC-3') fr^m an isolated genomic library 
phage obtained with the TM5-6 encoding DNA probe. In stage 2, 
fragments B and C were joined by PGR using primers P7 and PlO. In 
stage 3, fragments A and B/G were joined together using primer P5 
and PlO to obtain a fragment with a length of -^1200 base pairs 
encoding the fiill-length H4 receptor. DNA encoding a hemagglutinin 
(HA) epitope tag (YFYDVPDYA) was inserted after the start methi- 
onine codon using PGR mutagenesis. This DNA was ligated into the 
EcoKV site of the mammalian expression vector pcDNA3 (Invitro- 
gen, Carlsbad, CA) and its sequence was determined. 

Northern Expression Analyses. mRNA from various human 
and rat tissues were extracted as described previously (Marchese et 
al., 1994). Briefly, poly(A)'^ RNA was isolated using oligo(dT) cellu- 
lose spin columns (Pharmacia, Uppsala, Sweden), denatured and 
size fractionated on a 1% formaldehyde agarose gel, transferred onto 
nylon membrane, and immobilized by UV irradiation. The blots were 
hybridized with ^^P-labeled DNA fragments encoding the human 
and rat H4 receptor, washed with 2X standard saline/phosphate/ 
EDTA and 0.1% SDS at 50**C for 20 min and with O.lX standard 
saline/phosphate/EDTA and 0.1% SDS at SO'C for 2 h and exposed to 
X-ray film at -70**G in the presence of an intensifying screen. 

Radioligand Binding Studies. For these studies, the HA- 
tagged H4 receptor was transiently transfected into human embry- 
onic kidney 293 cells into 100-mm plates using FUGENE-6 (Roche 
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Fig. 1. Alignment of four human his- 
tamine receptors HI, H2, H3 and H4. 
Residues identical among the recep- 
tors are shaded. Numeric amino add 
positions are indicated on the right. 
The presence of introns interrupting 
the H4 sequence are shown as "A". H4 
sequence data has been deposited in 
GenBank (accession number 
AY008280). 



Molecular Biochemicals, Rotkreuz, Switzerland) at a 6:1 ratio of 
FUGENE/DNA as deUiled previously (Shapiro et al., 2000). At 72 h 
after transfection, cells were harvested and membranes prepared as 
described previously (Shapiro et aL, 2000); membranes were stored 
as pellets at -SO'C until use. Radioligand binding assays were 
performed in a total volume of 500 /il in 50 mM Tris-Cl, 0.5 mM 
EDTA, pH 7.4, with 16 nM pHJpyrilamine in the 96-well format. 
After a 1-h incubation at room temperature, membranes were har- 
vested by rapid filtration with a Brandel Harvester followed by two 
quick washes of ice-cold binding buffer. After drying, filters were 
placed into scintillation fluid and quantified by liquid scintillation 
spectrometry. Data were analyzed using GraphPad Prism (Graph- 
Pad Software, San Diego, OA) and data presented represents the 
mean ± S.E.M. of computer-derived estimates of at least three sep- 
arate experiments, each done in duplicate. For inhibition studies, 12 
concentrations of unlabeled ligand spanning 6 log units of test drug 
were used; for saturation studies, six concentrations of labeled ligand 
spanning 3 log units were used. 

Immunoblot Analysis. The HA-tagged H4 receptor was tran- 
siently trans fected into COS- 7 monkey kidney cells (American Type 
Culture Collection, Manassas, VA) and membranes prepared from 
these cells as described previously (Lee et al., 2000). In brief, tissues 
were solubilized in sample buffer consisting of 50 mM Tris-HCl, pH 
6,5, 1% SDS, 10% glycerol, 0.003% bromphenol blue, and 10% 2-mer- 
captoethanol. The samples were subjected to polyacrylamide gel 
electrophoresis with 12% acrylamide gels and electroblotted onto 
nitrocellulose as described previously (Ng et al., 1996). HA-tagged 
H4 immunoreactivity was revealed with the 3F10 rat monoclonal 
antibody (Roche, Laval, Quebec). 

Internalization Studies. For these studies, HA-tagged H4 re- 
ceptors were transfected into HEK-293 cells as described above into 
100-mm plates. At 24 h after transfection, cells were split into 
24-well plates containing poly-lysine-coated glass cover slips using 
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Dulbecco's modified Eagle's medium containing 10% dialyzed fetal 
calf serum. The medium was replaced 24 h later with serum-free 
Dulbecco's modified Eagle's medium. The next day, cells were ex- 
posed to 100 ^i.M histamine for 0, 2, 5, or 15 min and then fixed with 
freshly prepared 4% paraformaldehyde in PBS for 30 min at room 
temperature. Cells were then permeabilized on ice (0.2% Triton 
X-100 in ice-cold PBS) for 20 min and then incubated with blocking 
buffer (5% nonfat dry milk, 2% bovine serum albumin in PBS) for 1 h 
and then incubated with a 1:2000 dilution of monoclonal anti-HA 
antibody in blocking buffer overnight at 4''C. The next day, after 
extensive washing in room temperature PBS, cells were incubated 
with secondary antibody (BODIPY-labeled goat anti-mouse; 1:250 
dilution in blocking buffer) for 1 h, washed extensively with PBS, and 
prepared for con focal microscopy as detailed previously (Berry et al., 
1996; Kristiansen et al,, 2000). Internalization was quantified as 
described previously (Berry et al., 1996; Willins et al., 1999). 

Results and Discussion 

As part of our ongoing search of novel genes encoding 
GPCRs, we queried the C^enBank sequence databases main- 
tained by NCBI with known GPCR sequences. A search with 
the histamine H3 receptor sequence retrieved partial se- 
quences of a novel GPCR-encoding gene in an unfinished 
sequence of a human contig mapped to chromosome 18 (Gen- 
Bank accession number AC007922). The retrieved sequence 
was obtained in three separate fragments, including one 
fragment that seemed to encode the receptor from the start 
methionine to TM2, another fragment encoding TM3, and a 
third fragment encoding TM5 through TM7 of a novel GPCR- 
like gene. These partial sequences were used to design prim- 
ers for PGR amplification of human genomic DNA. Two frag- 




Fig. 2. Representative schematic of 
the human H4 receptor embedded in a 
cell membrane (box). Transmembrane 
regions are numbered and depicted 
with a top (extracellular)/bottom (in- 
tracellular) orientation. iV-linked gly- 
cosylation sites are indicated in blue. 
Amino acids in red represent residues 
shared with the histamine receptor 
H3. 
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ments of DNA, encoding the start methionine to TM2 and 
TM5 to TM6, respectively, were obtained and used as probes 
to screen a human genomic library. The Met-TM2 probe 
retrieved two phages encoding the 5' end of the novel gene 
from the start methionine to TM2, whereas the TM5-TM6 
probe retrieved four phages encoding the 3' end of the gene 
from TM5 to the stop codon. 

Using the TM5-TM6 probe, we screened several tissues by 
Northern analyses, which revealed a 3.0-kb signal in rat 
testis, A human testis cDNA Ubrary was screened with both 
probes, which isolated two identical phages encoding the 
sequence from TM2 to TM6 of the gene. The sequences were 
identical in regions of overlap, confirming them to be partial 
sequences of the same gene. A comparison of the cDNA 
fragment to the GenBank genomic sequence revealed two 
introns interrupting the H4 receptor sequence. The first in- 
tron was -8 kb in length and located within the TM2-encod- 
ing region (interrupting the translated sequence "LNLAISD- 
FFVG. . . VISIPLYIPH"). The second intron was downstream 
in the region encoding the second intracellular loop (inter- 
rupting the translated sequence "DRYLSVSNA. . . VSYR- 
TQHTGV"). The length of this intron could not be deter- 
mined, because the GenBank genomic sequence was 
incomplete. 

A BLASTX search of the GenBank database with this 
novel DNA sequence revealed the greatest overall identity 
with the histamine receptor H3 (-40%) and, more distantly, 
with other amine-type receptors of the GPCR family (<30%). 
As also reported with the H3 receptor (Lovenberg et al., 
1999), this novel receptor shared greater identity with other 
amine receptors including the serotonin, adrenergic, dopa- 
mine, and muscarinic receptors (— 25%) than the histamine 
receptors HI or H2 (-20%). An alignment with the known 
histamine receptors (Fig. 1) revealed a higher sequence sim- 
ilarity to the H3 receptor (58% in the TM regions) than to the 
HI and H2 receptors (26 and 27% in the TM regions, respec- 
tively). It has been determined from their sequence similar- 
ities and phylogenetic analyses that the members of the 
histamine receptor family may have evolved from different 
ancestral genes and, through convergent evolution, acquired 
the residues to recognize and bind histamine (Leurs et al., 
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2000). This novel receptor, which we have named H4, seems 
to be a novel member of the histamine receptor subfamily, 
with closest relation to the H3 receptor. The amino acid 
sequence of the H4 receptor revealed many conserved resi- 
dues and motifs found within the GPCR famUy (Fig. 2). 
Among these is an aspartic acid residue in TM 3 that is 
conserved in all cationic amine receptors and has been shown 
to be important in binding various amines to GPCRs (Sava- 
rese and Fraser, 1992). In addition, the human Met-TM2 
probe was used to screen a rat genomic Hbrary, which re- 
trieved a phage encoding the 5' end of the novel gene from the 
start methionine to TM2. This fragment shared 71% identity 
(83% in the TM regions) to the human H4 sequence, reveal- 
ing a rat H4 ortholog. 

The human Met-TM2 and TM5-TM6 probes were used in 
Northern analyses of various human and rat tissues. In the 
rat, the TM5-TM6 probe revealed a single transcript of 3 kb 
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Fig. 3. Northern blot analysis of the distribution of H4 mRNA in various 
rat tissues. Each lane contained 10 ^tg of poly(A)* RNA. 



Fig. 4. Saturation binding studies of [^Hlhistamine (A) and pHjpyril- 
amine (B) specific binding in membranes prepared from HEK-293 cells 
transiently transfected with H4 cDNA. Shown are typical results from 
experiment replicated three (A) and two times (B), with data representing 
mean of duplicate determinations. The curve represents the theoretical 
fit of the data A) (K^ = 44 nM; B„„ = 235 fmoVmg) and B) {K^ = 32 nM; 

- 437 fmol/mg). Nonspecific binding was determined with 100 fiM 
mianserin. 



in the testis (Fig. 3). The rat DNA fragment encoding from 
the start methionine to TM2 was used in Northern analyses 
of various rat tissues, revealing a 3-kb transcript in intestine 
(data not shown). The H3 and H4 receptors had significantly 
different mRNA expression distributions. H4 mRNA was de- 
tected in two peripheral tissues (with no detectable levels in 
brain or various peripheral tissues, including heart, stomach, 
small intestine, kidney, or liver). In contrast, H3 mRNA has 
been reported to be abundant in the brain (Lovenberg et al., 
1999). Thus, H4 is not likely to be the H3-subtype character- 
ized previously in brain tissue (West et al., 1990; Leurs et al., 
1996), which suggests that yet another histamine receptor 
subtype remains unidentified. 

To test the H4 receptor for pharmacological characteriza- 
tion, we constructed a full-length open reading frame by 
individually amplifying and joining the three H4-encoding 
fragments by PGR. The receptor was expressed in HEK-293 
cells, and a variety of tritiated ligands including [^Hjhista- 
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Fig. 5. Two sets (A and B) of competition binding of [^Hlpyrilamine with 
various ligands in membranes prepared from HEK-293 cells transiently 
transfected with H4 cDNA. Shown are typical results from competition 
binding experiments using 20 nM [^Hlpyrilamine and various concentra- 
tions of unlabeled ligands. Also shown are typical results obtained with 
untransfected HEK-293 cells studied in parallel (A). 
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mine (histamine receptor nonselective), pHlpyrilamine (HI 
receptor-selective), and pH]tiotidine (H2 receptor-selective) 
were tested against unlabeled ligands including mianserin, 
cyproheptadine, histamine, and clozapine. Specific binding 
obtained with pHJhistamine (44 nM) or pH]pyrilamine 
(5-20 nM) and mianserin (100 /iM) represented between 40 
and 75% of total binding. Saturation binding studies per- 
formed with pHlhistamine and pHJpyrilamine indicated re- 
spective values of 44 and 32 nM and respective 
values of 235 and 437 fmol/mg of membrane protein (Fig. 4). 
No specific pH] histamine or pH]pyrilamine binding was ob- 
served with untransfected HEK-293 cells. 

Competition binding studies with various selective and 
nonselective histaminergic compounds were demonstrated. 
Typical data are shown in Fig. 5 and summarized in Table 1. 
The highest affinities were for amitriptyline and chlorprom- 
azine, which are tricyclic compounds that have high affinity 
for the HI histamine receptor (see on-line database at: 
http://pdsp.cwru.edu/pdsp.asp). Doxepin, cinnarizine, and 
promethazine (Hl-selective antagonists) also displayed high 
affinity for the H4 receptor. Imetit (H3-selective agonist) and 
dimaprit (H2-selective agonist) had weak affinities for the H4 
receptor, whereas mianserin (an HI and H2 antagonist), 
cyproheptadine (a nonselective histamine/serotonin antago- 
nist), and clozapine (an atypical antipsychotic drug with high 
affinity for a large number of receptors) had moderate affin- 
ities. The pharmacological profile of the H4 receptor is dis- 
tinct from the histamine receptors (Table 1). 

An HA epitope tag-encoding sequence was inserted after 
the start methionine for Western blot visualization. This 
plasmid was transiently transfected into COS-7 cells. Immu- 
noblot analyses of membranes from these cells revealed high 
expression of the H4 receptor (Fig. 6A), with bands at 44 kDa, 
85 kDa, and higher molecular mass species. The 44-kDa band 
matched the expected mass of the unglycosyiated receptor 
and the 85-kDa band matched the glycosylated form. The 
bands >250 kDa represent oligomeric receptor species (Lee 
et al, 2000). The functional activity of the HA-tagged H4 
receptor was examined by measuring histamine-induced in- 
ternalization. As shown in Fig, 6B, exposure to 100 fjM 

TABLE 1 

Ligand affinities for the H4 receptor 

Data represent mean ± S.E. for two to four separate experiments. Twelve different 
concentrations spanning 6 log units of test drug were used to displace 15 nM 
(^Hlpyrilamine. 







-LogECso (Ki) 


Hill Coefficient 






nM 




Amitriptyline 


7.31 


± 0,26 (33.6) 


0.8 


Chlorpromazine 


7.13 


± 0.29 (50.2) 


0.7 


Doxepin 


6.79 


±0.13(105.9) 


0.6 


Cinnarizine 


6.73 


± 0.20 (141.6) 


0.7 


Promethazine 


6.71 


± 0.19 (150.2) 


0.6 


Cyproheptadine 


6.53 


± 0.09 (201,5) 


0.9 


Clemizole 


6.16 


± 0.12 (402.2) 


0.5 


Mianserin 


5.96 


± 0.40 (750.0) 


0.9 


Clozapine 


5.90 


± 0.22 (849.6) 


1.2 


Chlorpheniramine 


5.39 


± 0.16 (2910.0) 


1.1 


Histamine 


5,30 


± 0.04 (3442.3) 


0,5 


Imetit 


5,25 


± 0.44 (3795,6) 


0.2 


Pheniramine 


5.21 


± 0.47 (4184,0) 


0.2 


Dimaprit 


4,76 


±0.19(11812.7) 


0,4 


a-Methylhistamine 




>10,000 


Undetermined 


Cimetidine 




> 10,000 


Undetermined 
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histamine caused an internalization of H4 receptors from the 
plasma membrane to intracellular sites. Quantification re- 
vealed a rapid internalization of H4 receptors with signifi- 
cant internalization at 2 min after agonist exposure (Fig. 6C). 
These results indicated that the surface expression of the H4 
receptor is functionally regulated by histamine exposure in a 
time-dependent fashion. 

During the preparation of this article, other researchers 
(Oda et al., 2000) also reported the identification of a cDNA 
encoding a novel histamine receptor. This cDNA sequence 
varied from our sequence at three nucleotide positions, which 
translated into differences in three amino acids. Specifically, 
Alal38, His206, and Gln253 (as found in our sequence) were 
replaced by valine and two arginines, respectively. In the 
study by Oda et al., expression analyses revealed signals in 
peripheral blood leukocytes, small intestine, spleen and co- 
lon, and no expression in the brain. In addition, they reported 
histamine signaling through the novel receptor to be pertus- 



sis toxin-sensitive, suggesting a Gi/^ pathway of activation. 
For our receptor, we examined several second messenger- 
effector systems. We were not able to demonstrate H4 recep- 
tor-mediated inhibition of forskolin-stimulated adenylyl cy- 
clase, alterations in phosphoinositide hydrolysis, or mitogen- 
activated protein kinase phosphorylation (extracellular 
signal-regulated kinase 1/2 phosphorylation) in HEK-293 
cells. 

In conclusion, we report the discovery of a novel histamine 
receptor, H4. Previously, it was observed that histamine 
receptors shared greater sequence similarities with other 
biogenic amine-binding GPCRs than with one another. H4 
shared highest sequence similarity with the previously re- 
ported histamine H3 receptor. In combination with the HI, 
H2, and H3 receptors, this receptor, with its unique distri- 
bution and pharmacology, will undoubtedly provide further 
insight into the physiological functions and therapeutic ap- 
plications of this receptor family. 
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Fig. 6. A, immunoblot analysis of membranes from mock-transfected COS-7 cells (lane 1) and COS-7 cells expressing the HA-H4 receptor (lane 2). 
Membrane protein (25 ^tg) was used in each lane. Arrows indicate the unglycosylated (bottom) and glycosylated (top) receptor. B) histamine-mduced 
internalization of the H4 receptor in HEK-293 cells. Shown are representative confocal micrographs in whidi IW.-tagged H4 receptors were whined 
in cells exposed to vehicle (PBS) or 100 ptM histamine for 5 min and then prepared for microscopy as descnbed previously (Beny et al., 1996; WiUins 
et al 1999; Kristiansen et al., 2000), Arrowheads and arrows indicate cell-surface and internalized receptors, respective y. C, histamme-mduced 
Umeiependent internalization of the H4 receptor in HEK-293 cells. Shown are the mean ± S.E.M. of the percentage mtern^ization of HA-tag^ H4 
receptors in HEK-293 cells (n = 20-30 cells/time point) after exposure to histamine (100 fiM) for various fame penods. ♦♦♦significantly different 
compared with O-min value (P < 0.0001). 
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APPENDIX G 



Cancer Genetics 



First p53 Relative May Be a 
New Tumor Suppressor 



p73a 



p73p 



p53 



In the 20 years since its discovery, the p53 
gene has become one of the most heavily 
scrutinized genes in history. Indeed, it's ref- 
erenced in over 8000 papers in Medline, 
the online biomedical abstract ser- 
vice. The fascination is easy to un- 
derstand: Loss or inactivation of p53, 
which is a so-called tumor-suppressor 
gene, is thought to contribute to the 
development of 50% of . all human 
cancers. All that time, p53 was thought 
to be an only child, with no close 
relatives. Now, researchers have dis- 
covered a new gene, a long-lost cousin 
called p73, that bears a strong resem- 
blance to p53. 

It is being greeted with the same 
surprise as any newfound relative. 
"Given the intense interest in this area, 
the fact that {p73] slipped through the 
cracks is surprising," says cancer biolo- 
gist Tyler Jacks of the Massachusetts 
Institute of Technology (MIT), But the 
new gene should generate some intense 
interest of its own, because its protein 
not only resembles the p53 protein, but 
also seems to have similar activities. 
The p53 protein acts as a "security guard," 
deployed when a cell's DNA is damaged to 
prevent the cell from becoming cancerous. 
It does this by either inhibiting cell growth 
until the damage is repaired or causing the 
cell to commit suicide through a process 
called programmed cell death or apoptosis. 
The p73 protein appears to share these growth- 
inhibiting and apoptosis-promoting effects, 
although what triggers them and exactly what 
its cellular role is are both unknown. 

Those findings, together with f)73's loca- 
tion in a region of chromosome I that is often 
deleted in cancers including neuroblastoma, 
a malignant tumor of nervous tissue, suggest 
that it, too, may be a tumor suppressor. "TThis 
(discovery] will titillate a whole lot of 
people," predicts cancer geneticist Bert 
Vogelstein of Johns Hopkins University 
School of Medicine, a pioneer of p53 re- 
search. Indeed, if p73 can stand in for p53 
when that gene is lost, it might be possible to 
design new cancer drugs that work by turning 
on p73 in tumors lacking p53. 

Molecular biologist Daniel Caput and his 
colleagues at the pharmaceutical company 
Sanofi Recherche in Labege, France, identi- 
fied the p73 gene while looking for something 
completely different, namely genes that re- 
spond to certain immune system regulators. 



When the French team sequenced the many 
potential targets their screen had turned up, 
they were shocked to find that one false posi- 
tive had remarkable similarities to p53. 



in squid that was supposed to be that species's 
version of p53 is actually more similar to p73 . 

The structural similarities between p53 
and p73 also suggested that the proteins 
might have similar roles in the cell. So Caput 
and his colleagues joined forces with their 
longtime collaborator, cell biologist Frank 
McKeon, who studies gene expression and 
cell division at Harvard Medical School in 
Boston, to look for parallels. One way p53 
restrains cells that have damaged DNA is by 
triggering the production of a protein called 
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No longer alone. The p73 proteins, although longer, resemble p53 in three regions: the transcription ac- 
tivation (TA, 29% Identical) and DNA binding domains (63% identical) and also the domain where p53 
binds itself (OLIGO, 38% identical). The labeled amino acids indicate residues that are frequently mu- 
tated in p53 and are conserved in p73. 



As the researchers report in the 22 Au- 
gust issue of Cell, the proteins made by p73 
are somewhat larger than p53. But they 
found that one section of p73 closely re- 
sembles the so-called "core binding region" 
of p53. Many of p53's activities depend on its 
ability to regulate other genes, and the core 
binding region is where the protein attaches 
itself to the DNA when exerting its effects. 
Of 177 amino acids in that region, 1.12 are 



"This [discovery] will 
titillate a whole lot of 
people." 

— Bert Vogelstein 



identical. Additional similarities turned up 
in two other sections thought to be involved 
in p53 activity — one needed for its gene 
regulatory effects and another where it ap- 
parently binds to itself. These resemblances 
are enough to suggest that the two genes are 
the progeny of a gene that was duplicated in 
some ancient cellular event. Indeed, p73 may 
be the ancestral gene, because a gene found 



p2I, which blocks cell division. The Caput- 
McKeon team found that adding p73 to a 
line of neuroblastoma that lacks the gene 
also triggers p2I production, an indication 
that p73 inhibits cell growth through the 
same pathway used by p53. 

In a paper that appears in this week's issue 
of Nature, molecular biologist William Kaelin 
at the Daiia-Farber Cancer Institute in Boston 
and his colleagues report similar findings with 
another tumor cell line. Kaelin*s team also 
found evidence that p73 can mimic p53's abil- 
ity to cause cell suicide. When overexpressed 
in these cells, p73 latched onto stretches of 
DNA to which p53 normally attaches itself 
when instructing a cell to self-destruct. 

Together, the findings suggest that p73, 
too, may be a tumor suppressor, an idea that is 
buttressed by its provocative chromosomal lo- 
cation. The Caput- McKeon team found p73 
in a region near the tip of chromosome 1 that 
was already suspected of harboring one or 
more tumor suppressor genes, because the 
region is often missing in tumor cells. 

The teams did find one major point of 
difference between the two genes, however. 
Unlike p53 protein, p73 does not seem to be 
produced in response to DNA damage. That 
implies that the protein is not a cell "security 
guard" the way p53 is. Early results of experi- 
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ments in which McKeon and Caput deleted 
the p73 gene in mice suggest another possi- 
bility: It may be "developmentally impor- 
tant," he says, especially in the brain and 
immune system, although how remains to 
be clarified. 

If p73 is a tumor suppressor, it may be- 
have somewhat differently than p53 and 
other previously discovered tumor suppres- 
sors. Classic tumor-suppressor genes require 
two "hits" to be inactivated— a partial or 
complete deletion of one of the two gene 
copies, for example, and another, lesser change 
that cripples the second copy. But Caput. 
McKeon, and their colleagues have evidence 
that one p73 copy is already inactive m nor- 
mal cells— the apparent result of a mysterious 
process called imprinting. Its precise function 
isn't known, but during embryonic develop- 
ment, imprinting alters certain genes so that 
the copy inherited either from the mother or 
the father is specifically shut down . 

If one p73 copy has been silenced by im- 



printing, then only one hit— loss of the ac- 
tive copy— might be all that it takes to tip a 
cell into the uncontrolled growth of cancer. 
Says Kaelin, "p73 may be the first example of 
a new paradigm for how tumor-suppressor 
genes are involved in cancer." 

Indeed, molecular biologist Rogier Versteeg 
of the Academic Medical Center in Am- 
sterdam, the Netherlands, has evidence that an 
imprinted gene may be involved in neuroblas- 
toma development. He has identified two 
sites of chromosome damage that contribute 
to neuroblastoma by knocking out as-yet- 
undiscovered tumor-suppressor genes. Both 
lie in the same region of chromosome 1 where 
p73 is located, and one illustrates "a strong 
bias" toward loss from the maternal copy of 
the chromosome in the cancer cells. This 
bias implies that this specific copy is the ac- 
tive one and must be lost to cause the cancer. 

Other work from the Caput-McKeon team 
suggests that this mystery tumor-suppressor 
gene may be p73. When they looked for the 



gene in neuroblastoma cell lines, they found 
that one p73 copy had been lost. And while 
they couldn't uncover any mutations in the 
remaining copy, most of the cell lines made no 
detectable p73 protein, implying that the sec- 
ond copy had been silenced by imprinting. 

In spite of the differences in the roles of 
p53 and its new cousin, both in normal cells 
and in cancer, the family resemblances may 
be strong enough for them to substitute for 
each other. If so, says MIT's Jacks, cancer 
might be treated by finding a way to switch 
on p73 in tumor cells that have lost p53. 
"Even if p73 is not normally involved in 
tumor suppression, maybe it could be re- 
cruited," says Jacks. Now McKeon and Caput 
are searching for further family members. 
But the discovery of p73 is already certain to 
captivate their peers. 

-Steven Dickman 



Steven Dickman is a free-lance uTiter in Cambridge, 
Massachusetis. 



.AIDS RESEARCH. 



HIV Gets a Taste of Its Own Medicine 



In an attempt to fight fire with fire, research- 
ers have engineered a virus that usually in- 
fects cattle to attack the AIDS virus in hu- 
mans. The innovative approach has so far 
shown promise only in test-tube experiments, 
but it is attracting widespread attention 
among AIDS researchers. "Its really on the 
verge of a breakthrough," says Nava Sarx'er, 
who oversees development of novel AIDS 
treatments at the National Institute of Al- 
lergy and Infectious Diseases (NIAID). 

Yale University virologist John Rose and 
co-workers describe in the 5 September is- 
sue of Cell how they have constructed a 
potential HIV treatment by modifying ve- 
sicular stomatitis virus (VSV), which farm- 
ers detest because it causes a mouth infec- 
tion in cattle that prevents them from eat- 
ir\g. As the Yale researchers' experiments 
show, their newfangled VSV selectively tar- 
gets and destroys HIV-infected human cells. 
"It*s a pretty interesting way of harnessing a 
virus for peaceful purposes," says the Uni- 
versity of Pennsylvania's Robert Doms. 'it's 
a very clever approach." 

The work builds on recent discoveries 
made by Doms and others about a two-part 
handshake between HIV and the cells it in- 
fects. After HIV binds to the CD4 receptor 
on a white blood cell, it also must link to 
another molecule found on the cell's surface, 
known as a chemokine receptor. Once these 
handshakes are complete, HIV gains entry, 
and shortly thereafter, new virus proteins 
make their way to the cell's outer coating, 
where they stick out like a flag of victory. 
Rose and colleagues reasoned that if VSV 



could be induced to express these receptors 
on its surface, they would bind to the HIV 
proteins displayed on infected cells, turning 
VSV into a kind of guided missile. To test 
this idea, the researchers stitched into VSV 
the genes that code for CD4 and one of HIV's 
favored chemokine receptors, CXCR4, and 
added their engineered VSV to a culture 




Trojan horse. CD4 and CXCR4 receptors ex- 
pressed by genetically engineered VSV bind to 
HIV's gp120 protein on surlace of infected cell. 

containing HIV-infected cells. The virus 
did, indeed, target just the infected cells, kill- 
ing them rapidly. "VSV is so fast," says Rose- 
much faster than HIV, he notes. 

A potential downside to this approach is 
that the modified VSV might kill cells that 
aren't infected by HIV. Rose believes that 
won't happen because he has stripped VSV 



of its own surface protein, which is what al- 
lows it to infect a broad range of cells. "With- 
out its normal coat, it can't infect anything," 
says Rose. But only animal tests will provide 
evidence of that, cautions NIAID director 
Anthony Fauci. 

Although Fauci has high praise tor the 
concept's ingenuity, he is concernoJ that it 
might take an impracticably high do^e ot the 
modified VSV to make a real dent in a 
person's HIV levels. Another worry, says 
monkey researcher Ronald Desrosiers of the 
New England Regional Prinrare Research 
Center in Southborough, Massachu<etrs, is 
that the body will quickly develop an im- 
mune response against V5V, limiting its 
ability to attack HIV. 

Still, Sarver, Fauci, and others are anx- 
ious for Rose and colleagues to put their viral 
guided missile to more stringent test-tube 
and animal tests. Desrosiers already has be- 
gun working with Rose to test the concept in 
monkeys that have been infected with SIV, 
HIV's simian cousin. Desrosiers expects to 
have results in the next few months. Even if 
they are positive, however, human trials will 
require the approval of the Food and Drug 
Administration, which has shown great cau- 
tion in the past about putting potentially 
therapeutic viruses into people. 

Rose's strategy is not limited to attacking 
HIV. NIAID's Sarver suggests that if re- 
searchers can swap different receptors into 
this "gutted" VSV, the precisely targeted vi- 
ruses could be used in everything from vac- 
cines to gene therapies to cancer treatments. 
"We're not there yet," says Sarver, "but the 
potential applications are enormous." 

-Jon Cohen 
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Summary 

Steroid hormones exert profound effects on differenti- 
ation, development, and homeostasis in higher eu- 
karyotes through interactions with nuclear receptors. 
We describe a novel orphan nuclear receptor, termed 
the pregnane X receptor (PXR), that is activated by 
naturally occurring steroids such as pregnenolone and 
progesterone, and synthetic glucocorticoids and anti- 
glucocorticoids. PXR exists as two isoforms, PXR.1 
and PXR.2, that are differentially activated by steroids. 
Notably, PXR.1 is efficaciously activated by pregneno- 
lone 16a-carbonitrile, a glucocorticoid receptor an- 
tagonist that induces the expression of the CYP3A 
family of steroid hydroxylases and modulates sterol 
and bile acid biosynthesis in vivo. Our results provide 
evidence for the existence of a novel steroid hormone 
signaling pathway with potential implications in the 
regulation of steroid hormone and sterol homeostasis. 

Introduction 

Steroid hormones are cholesterol derivatives that serve 
as signaling molecules to coordinate the expression of 
complex gene programs in higher eukaryotes. These 
molecules exert their effects by diffusing into cells and 
interacting with specific intracellular receptors. Recep- 
tors for each of the major classes of sex and adrenal 
steroids have been characterized (Evans, 1988; Beato 
et al., 1995; Mangelsdorf et al., 1995). In the absence of 
their cognate ligands, the steroid hormone receptors 
remain sequestered in the cytoplasm through interac- 
tions with large multiprotein complexes containing heat 
shock proteins. However, the binding of ligand causes 
the steroid hormone receptors to be released from these 
complexes and translocated into the nucleus (Pratt, 

# To whom correspondence should be addressed. 



1993). Once inside the nucleus, the activated receptors 
regulate the expression of target genes by binding as 
homodimers to short DNA sequence motifs, termed hor- 
mone response elements (HREs) (Glass, 1994). In this 
manner, the steroid hormone receptors function as li- 
gand-activated transcription factors. 

The molecular cloning of steroid hormone receptors 
revealed that they comprise a subfamily within a larger 
superfamily of structurally related proteins (Evans, 1988; 
Mangelsdorf and Evans, 1 995; Mangelsdorf et aL, 1 995). 
This superfamily also includes receptors for nonste- 
roidal, lipophilic molecules such as thyroid hormone, 
retinoids, fatty acids, and eicosanoids. The nonsteroid 
receptors differ from their steroid hormone receptor 
counterparts in several respects (Mangelsdorf and Evans, 
1995; Mangelsdorf et al., 1995). First, the nonsteroid 
hormone receptors are not sequestered in the cyto- 
plasm in the absence of their cognate ligands but in- 
stead reside within the nucleus. Second, whereas ste- 
roid hormone receptors generally bind to their HREs as 
homodimers, most of the nonsteroid hormone receptors 
identified to date bind to DNA as heterodimers with 
the 9-c/s retlnoic acid receptors (RXRs) (Glass, 1994; 
Mangelsdorf and Evans, 1995). Finally, the two classes 
of receptors recognize different types of HREs: steroid 
hormone receptors generally bind to HREs composed 
of two half-sites organized as a palindrome with a three- 
nucleotide spacer, while nonsteroid hormone receptors 
preferentially bind to HREs composed of two half-sites 
organized as a direct repeat (DR), with the number and 
composition of the nucleotides separating the half-sites 
serving as important determinants of receptor selectivity 
(Umesono et aL, 1991; Glass, 1994; Mangelsdorf and 
Evans, 1995). 

In addition to receptors with established ligands, ap- 
proximately 30 other members of the nuclear hormone 
receptor family have been isolated from vertebrates. 
These related proteins lack identified ligands and, as a 
consequence, have been termed orphan nuclear recep- 
tors (Evans, 1988; Mangelsdorf and Evans, 1995; En- 
mark and Gustafsson, 1996). The search for hormonal 
activators of the orphan receptors has created exciting 
opportunities to discover novel endocrine signaling 
pathways with implications in both normal physiology 
and disease. Work with orphan nuclear receptors has 
led to the identification of fatty acids and eicosanoids 
as ligands for the peroxisome proliferator-activated re- 
ceptors (Forman et al., 1997; Kliewer et al., 1997; Krey 
et al., 1997), retinoids and farnesoids as activators of 
RIP14/FXR (Forman et al., 1995; Zavacki et al., 1997), 
and various oxysterols as activators of the LXR and SF-1 
orphan nuclear receptors (Janowski et al., 1996; Lala et 
al., 1997; Lehmann et al., 1997). 

In this report, we describe a novel orphan nuclear 
receptor, which we have designated PXR for pregnane 
X receptor. Like nonsteroid hormone receptors, PXR 
binds as a heterodimer with RXR to an HRE composed of 
two half-sites organized as a DR. Surprisingly, however, 
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PXR is efficaciously activated by several steroids, in- 
cluding naturally occurring pregnanes and synthetic glu- 
cocorticoids and antiglucocorticoids. Thus, PXR com- 
bines features of both the steroid and nonsteroid nuclear 
receptors. We suggest that PXR defines a novel steroid 
hormone signaling pathway that may account for at least 
some of the effects of synthetic glucocorticoids and 
antiglucocorticoids that do not appear to be mediated 
through the classical glucocorticoid receptor (GR) sig- 
naling pathway. 

Results 

Cloning of PXR.1 and PXR.2 

In an effort to identify new members of the nuclear re- 
ceptor family, we performed a series of motif searches 
of public EST databases. These searches revealed a 
clone from a mouse liver library in the Washington Uni- 
versity/HHMI EST database that had homology to the 
ligand-binding domains (LBDs) of a number of nuclear 
receptors. We used this partial sequence information to 
isolate larger clones from a mouse liver cDNA library. 
The nucleotide sequence of the longest cDNA clone 
encodes a novel orphan nuclear receptor of 431 amino 
acids that we have designated PXR.1 (Figure 1A). We 
also isolated a second cDNA clone, termed PXR.2, that 
was identical to PXR,1 except for the deletion of a 
stretch of 123 nucleotides extending from base pairs 
661 to 783 (Figure 1A). Examination of the PX/? genomic 
structure revealed that represents a splice variant 

of PXR.1 lacking a single exon (data not shown). The 
PXR.2 cDNA encodes a 390 amino acid protein that 
lacks a 41 amino acid region in the putative LBD of 
PXR.1. Sequence alignment with nuclear receptors for 
which the crystal structures have been solved indicates 
that the 41 amino acids that distinguish the PXR iso- 
forms lie between helices two and three of the canonical 
LBD structure (Wurtz et al., 1996). 

Sequence comparison with other members of the nu- 
clear receptor family showed that the PXR isoforms are 
most closely related to the Xenopus laevis orphan nu- 
clear receptor 1 (0NR1) (Smith et al., 1994), with PXR.2 
and 0NR1 sharing 70% and 46% amino acid identity in 
their DNA-binding domains (DBDs) and LBDs, respec- 
tively (Figure IB). Based upon this degree of homology, 
it is unclear at present whether PXR represents the 
mammalian homolog of 0NR1. With regard to mamma- 
lian receptors, PXR is most closely related to the vitamin 
D receptor (VDR) (Baker et al., 1988), sharing 64% and 
39% identity in the DBD and LBD, respectively. 

PXR Expression Pattern in the Embryo 
and the Adult 

The expression pattern of PXR was examined in both 
adult and embryonic tissues. Northern blot analysis was 
performed under high-stringency conditions with blots 
that included poly(A)+ RNA prepared from multiple adult 
mouse tissues and a probe that recognized both PXR 
isoforms. Abundant expression of PXR mRNA was ob- 
served only in the liver and intestine, where three distinct 
messages were detected Including a highly expressed 
mRNA species of 2.6 kb and two transcripts of 1.9 kb 
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Figure 1 . PXR is a Member of the Nuclear Receptor Superfamily 
(A) Nucleotide and predicted amino acid sequence of mouse PXR. 
The upstream in-frame stop codon is in bold. The predicted start 
initiation codon is at nucleotide 151. The 123 nucleoUde region 
extending from base pairs 661 to 783 that is present in PXR.I and 
absent in PXR.2 is underiined. A putative polyadenylation signal in 
the 3' untranslated region is boxed. 

(6) Amino acid sequence comparison between murine PXR and 
other members of the nuclear hormone receptor family. Similarity 
between PXR and other nuclear honnone receptor family members 
in the DNA- and ligand-binding domains are indicated as percent 
amino acid identity. The 41 amino acid region that distinguishes 
PXR.1 from PXR.2 is indicated by the cross-hatched area. Al! com- 
parisons were made with the PXR.2 isoform. 0NR1, Xenopus orphan 
nuclear receptor 1; VDR, human vitamin D receptor; TRp, human 
thyroid hormone receptor RARa. human retinoic acid receptor a: 
GR. human glucocorticoid receptor. 

and 4.4 kb that were expressed at lower levels (Figure 
2A). Weaker expression of the PXR mRNA was also 
detected in kidney and stomach (Figure 2A). No PXR 
mRNA was detected in the other tissues examined. 




Figure 2. Expression Pattern of PXR in Adult and Embryonic Tissues 

(A) Northern blot analysis. RNA size markers (in kb) are indicated 
at left. Arrows at right indicate the three species of PXR transcripts 
that were detected. (B-D) In situ hybridization analysis of PXRmRNA 
expression in El 8 mouse embryo sections with a probe that recog- 
nized both PXR isoforms (see probe 1 in Experimental Procedures). 

(B) Phosphoimage of an entire sagittal section showing specific 
labeling in intestine and liver. Specificity was ascertained by compe- 
tition with a 100-fold excess of unlabeled, specific oligonucleotide. 
This resulted in complete inhibition of labeling in liver and intestine 
while nonspecific signals remained in developing bone. (C and D) 
High power bright-field microscopy showing the abundance of silver 
grains in the liver (C) and in the intestine (D) where mRNA labeling 
was confined to the epithelium. The bar in (B) corresponds to 2 mM, 
and in (C) and (D) to 50 [lM. 

The embryonic expression pattern of PXR was exam- 
ined via in situ hybridization analysis using sections pre- 
pared from day 18 (E18) mouse embryos. Three oligonu- 
cleotide probes were designed that recognized PXR, 
including two that interacted with both PXR isoform 
mRNAs and a third that hybridized only to PXR. 7 mRNA. 
PXR mRNA was detected in the liver and intestine (Fig- 
ures 2B-2D). Staining in the intestine was confined to 
the epithelium (Figure 2D). No PXR expression was de- 
tected in embryonic kidney, adrenal, lung, thymus, heart, 
skeletal muscle, brain, or spinal cord (data not shown). 
While similar results were obtained with all three PXR 
probes, the signal intensity in liver and intestine was 
consistently higher in experiments performed with the 
probes that recognize both PXR isoforms (data not 
shown), suggesting that PXR. 2 is expressed in tissues 
that also express PXR. 7. We conclude from the Northern 
blot and in situ hybridization analyses that PXR is abun- 
dantly expressed in only a few tissues, including the 
liver and intestine, in both the mouse embryo and adult. 



PXR Is Activated by Synthetic Pregnanes 
and Glucocorticoids 

We next sought to determine whether PXR, like other 
members of the nuclear receptor family, possesses tran- 
scriptional activity that can be regulated in a hormone- 
dependent manner. As we lacked knowledge of a cog- 
nate HRE for PXR, we initially performed searches for 
activators using an established chimera system in which 
the LBDs of the two PXR isoforms were fused to the 
DBD of the yeast transcription factor GAL4 (Lehmann 
el al., 1995). Expression vectors for the GAL4-PXR chi- 
meras were transiently transfected into CV-I cells to- 
gether with a reporter plasmid containing five copies of a 
GAL4 DNA binding site upstream of the chloramphenicol 
acetyltransferase (CAT) reporter. Transfected CV-1 cells 
were systematically treated with a series of natural and 
synthetic compounds that included steroids, vitamin D 
analogs, thyroid hormone analogs, retinoids, fatty acids, 
and other small, lipophilic molecules, and reporter levels 
measured. 

Interestingly, we found that the activity of the GAL4- 
PXR.1 chimera was markedly induced by 10 jxM con- 
centrations of a variety of synthetic steroids including 
the glucocorticoids dexamethasone, dexamethasone-t- 
butyl-acetate, and dexamethasone-21 -acetate, and the 
pregnenolone derivative 6,16a-dimethyl pregnenolone 
(Figure 38). Dexamethasone-t-butylacetate and 6,16a- 
dimethyt pregnenolone were the most efficacious of these 
compounds (Figure 38). Remarkably, we found that PXR. 1 
was not only activated by GR agonists, but also by the 
GR antagonists RU486 and pregnenolone 16a-carboni- 
trile (PCN) (Figure 38). RU486 is a potent antiprogestin 
that binds to the GR at nanomolar concentrations (Cade- 
pond et ah, 1997). PCN is a weaker antagonist that has 
been shown to interfere with GR-mediated activation 
at micromolar concentrations (Schuetz and Guzelian, 
1984; Schuetz et aL, 1984). Thus, the L8D of PXR.1 was 
efficaciously activated by both agonists and antagonists 
oftheGR. 

Notably, the GAL4-PXR.2 chimera displayed a much 
more restricted activation profile. Of the steroids that 
activated GAL4-PXR.1. only dexamethasone-t-butyl- 
acetate activated GAL4-PXR.2 efficiently (Figure 38). 
We conclude that the 41 amino acid deletion that distin- 
guishes PXR.2 from PXR.1 has a marked effect on the 
responsiveness of the orphan receptor to synthetic ste- 
roids. 

PXR Functions through a Response Element 
Conserved in the CYP3A Gene Promoters 
PCN and dexamethasone treatment have previously 
been shown to induce the expression of the CYP3A 
family of genes in rodent liver, intestine, and kidney 
as well as in primary cultures of rodent hepatocytes 
(Elshourbagy and Guzelian, 1980; Heuman et al., 1982; 
Hardwick et al., 1983; Schuetz and Guzelian, 1984; 
Schuetz et al., 1984; Gonzalez et al., 1985; Debri et 
al., 1995). The CYP3A genes encode cytochrome P450 
hemoproteins involved in the hydroxylation of steroid 
hormones, including corticosteroids, progestins, andro- 
gens, and DHEA-sulfate, as well as a variety of xenobiot- 
ics (Nebert and Gonzalez, 1987; Juchau, 1990). The re- 
sponse to PCN and dexamethasone occurs at the level 
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Figure 3. Synthetic Glucocorticoids and Pregnenolone Derivatives 
Activate PXR 

(A) Structures of the steroids that activate the GAL4-PXR chimeric 
proteins. 

(B) CV-1 cells were cotransfected with expression plasmids encod- 
ing either GAL4-PXR.1 (filled bars) or GAL4-PXR.2 (open bars) and 
the reporter plasmid (UAS)s'tk-CAT. Cells were treated with vehicle 
alone (0.1% DMSO) or 10 p-M of the indicated steroids. Cell extracts 
were subsequently assayed for CAT activity. Data represent the 
mean of five data points from two different experiments ± SD. 

of transcription and has been mapped to a conserved 
region within the CYP3A1 and CYP3A2qene promoters 
that does not contain a typical glucocorticoid response 
element, but instead contains a DR of the nonsteroid 
nuclear receptor half-site sequence AGTTCA separated 
by a three-nucleotide spacer, a so-called DR-3 motif 
(Figure 4A) (Umesono et al., 1991; Miyata et al., 1995; 
Quattrochi et al., 1995; Huss et al., 1996). These data 
have led to speculation that a nonsteroid nuclear hor- 
mone receptor might be involved in mediating the ef- 
fects of PCN and dexamethasone. 

Given that PXR shares a high degree of homology 
with 0NR1 and VDR in the DBD (Figure IB) and that 
both ONR1 and VDR preferentially bind to DR-3 HREs 
as heterodimers with RXR (Umesono et al., 1991; Smith 
et al., 1994; Mangelsdorf and Evans. 1995), we postu- 
lated that PXR might bind to the CYP3A1 and CYP3A2 
DR-3 motifs as a heterodimer with RXR. In order to test 
this idea, gel mobility shift assays were performed using 
a radiolabeled oligonucleotide containing the CYP3A1 
DR-3 motif and in vitro synthesized PXR.1, PXR.2, and 
RXRa. Neither PXR nor RXRa bound to the CYP3A1 
DR-3 alone. However, both PXR.1 and PXR.2 bound 
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Figure 4. PXR Binds as a Heterodimer with RXRa to DR-3 Response 
Elements Present in the CYP3A1 and CyP3A2 Gene Promoters 

(A) Alignment of DR-3 motifs present in the promoter regions of the 
CYP3A1 and CYP3A2 genes. A mutated derivative of the CYP3A1 
DR-3 motif (ACYP3A1) used in the gel mobility shift assays and the 
position of the mutations is also shown. 

(B) Gel mobility shift assays were performed with a radiolabeled 
oligonucleotide containing the CYP3A1 DR-3 and in vitro synthe- 
sized PXR.1, PXR.2, and RXRa as indicated. 

(C) Gel mobility shift assays were performed with PXR.1 and RXRa 
(top panel) or PXR.2 and RXRa (bottom panel) and radiolabeled 
oligonucleotide containing the CYP3A1 DR-3 in the presence of 
either a 5-fold or 25-fold excess of unlabeled oligonucleotides con- 
taining the CYP3A1 DR-3. a mutated CYP3A1 DR-3 {ACYP3Ar). or 
the CYP3A2 DR-3 as indicated. 



efficiently to the CYP3A1 DR-3 as heterodimers with 
RXRot (Figure 4B). The PXR-RXRa complex with DNA 
was competed efficiently by an excess of unlabeled 
CYP3A1 DR-3 oligonucleotide or an oligonucleotide 
containing a closely related DR-3 motif from the CYP3A2 
gene promoter (Figures 4 A and 4C). No competition was 
seen with an oligonucleotide containing a mutated 
CYP3A1 DR-3 motif (Figures 4 A and 4C). Thus, both 
PXR isoforms can bind specifically as heterodimers with 
RXRa to DR-3 motifs found in the promoter regions of 
CYP3A genes. 

We next asked whether the PXR isoforms could in- 
duce gene expression through the CYP3A1 DR-3 ele- 
ment in response to steroids. Transient transfection 
assays were performed with a reporter plasmid con- 
taining two copies of the CYP3A1 DR-3 motif inserted 
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Figure 5. Synthetic Glucocorticoids and Pregnenolone Derivatives 
Activate PXR through the CYP3A1 DR-3 

(A) CV-1 cells were cotransfected with expression plasmids for full- 
length PXR.1 or PXR.2 and the (CYP3A1)2'tk'CAT repotter plasmid. 
Cells were treated with either vehicle (0.1% DMSO) alone (open 
bars) or 10 jiM dexamethasone-t-butylacetate (filled bars). Cell ex- 
tracts were subsequently assayed for CAT activity. 
(B and C) CV-1 cells were cotransfected with expression plasmids 
for full-length PXR.1 (B) or PXR.2 (C) and the (CYP3A1)2-tk-CAT 
reporter plasmid. Transfected cells were treated with the indicated 
concentrations of 6,1 6a -drn ethyl pregnenolone (closed circles), dexa- 
methasone-t-butylacetate (open circles), PCN (closed squares), or 
RU486 (open squares). Cell extracts were subsequently assayed for 
CAT activity and data plotted as the percentage of the maximal 
response obtained. Data points represent the mean of assays per- 
formed in triplicate. Similar results were obtained in two independent 
experiments. 

upstream of the minimal thymidine kinase promoter and 
the CAT gene l(CYP3A1)rtk-CAJ]. Interestingly, in the 
absence of steroids, PXR.1 was found to have a roughly 
20-fold higher basal level of activity than PXR.2 (Figure 
5A). Nevertheless, both PXR isoforms were efficiently 



activated by dexamethasone-t-butylacetate (Figure 5A). 
Dose-response analysis revealed dexamethasone-t- 
butylacetale to be a significantly more potent activator 
of PXR.I than PXR.2. with EC50 values of 0.8 and 5 p.M 
for PXR.1 and PXR.2. respectively (Figures 5B and 5C). 
Consistent with the results obtained using the GAL4- 
PXR chimeras. 6.16a-dimethyl pregnenolone, PCN, and 
RU486 activated full-length PXR.1 on the CYP3A1 re- 
sponse element but failed to activate full-length PXR.2 
(Figures 5B and 5C). 6.16a-dimethyl pregnenolone was 
the most potent of these compounds, activating PXR.1 
with an EC50 value of 300 nM (Figure 5B). Based upon 
these data, we conclude that the full-length PXR iso- 
forms can activate gene expression through the CYP3A 7 
DR-3 motif in response to synthetic steroids. 



PXRs Are Activated by Naturally 
Occurring Steroids 

As PXR was activated by synthetic steroids, we exam- 
ined whether a naturally occurring steroid might serve 
as the endogenous hormone for PXR, Accordingly, CV-1 
cells were transfected with expression plasmids for 
either of the full-length PXR isoforms and the (CYP3A1)2- 
tk-CAT reporter and treated with a variety of natural 
steroids, including progestins, glucocorticoids, min- 
eraiocorticoids, androgens, estrogens, bile acids, and 
oxysterols. Both PXR.1 and PXR.2 were activated by 
micromolar concentrations of certain of these steroids. 
Consistent with our findings that the pregnenolone de- 
rivatives 6,16a-dimethyl pregnenolone and PCN activate 
PXR.1, pregnenolone and its metabolites 17a-hydroxy- 
pregnenolone. progesterone, 17« -hydro xyprogesterone, 
and 5(5-pregnane-3,20-dione activated PXR.1 (Figure 
6B). Dose-response analysis revealed that all five of 
these pregnanes activated PXR.1 with ECso values in the 
5-20 (J.M range (Figure 6C). Since PXR.1 is activated by 
synthetic glucocorticoids, we were surprised to find that 
naturally occurring glucocorticoids, including Cortisol 
and corticosterone. had virtually no effect on PXR.1 ac- 
tivity. (data not shown). 

In analogous cotransfection experiments performed 
with PXR.2, only 5p-pregnane-3,20-dione was found to 
induce reporter activity >5-foId (Figure 6B). Notably, 
pregnenolone, progesterone, and their 17a-hydroxyl- 
ated derivatives, which were efficacious activators of 
PXR.1, had little or no activity on PXR,2 (Figure 6B). 
Thus, while both PXR isoforms are activated by naturally 
occurring pregnanes, PXR.1 is activated by a broader 
range of these steroids than PXR.2. The generation of 
LBD isoforms with distinct activation profiles provides 
a novel mechanism for increasing the regulatory diver- 
sity in the PXR signaling pathway. 



Steroids Promote the Interaction of PXR 
with a Coactivalor Protein 

We next sought to address whether the steroids that 
activated PXR did so through direct interactions with 
the LBD. Due to the lack of radiolabeled derivatives 
of the more potent PXR activators, we were unable to 
perform standard binding analyses. However, a number 
of laboratories have recently demonstrated that ligands 
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Figure 6. Naturally Occurring Steroids Activate PXR 

(A) Structures of naturally occurring steroids that activate PXR. 

(B) CV-1 cells were cotransfected with expression plasmids for full- 
length PXR.l (filled bars) or PXR.2 (open bars) and the (CYP3A1)r 
tk-CAT reporter plasmid. Transfected cells were treated with 10 jiM 
of the indicated steroids. Cell extracts were subsequently assayed 
for CAT activity and data plotted as fold-activation relative to cells 
treated with vehicle (0.1% DMSO) alone. Data points represent the 
mean of assays performed in triplicate ± SO. 

(C) Transfected cells were treated with the Indicated concentrations 
of pregnenolone (closed circles), 1 7a-hydroxypregnenolone (open 
circles), progesterone (triangles), 17a -hydroxy progesterone (open 
squares), or 5p-pregnane-3,20-dione (closed squares). Cell extracts 
were subsequently assayed for CAT activity and data plotted as 
fold-activation relative to cells treated with vehicle (0.1% DMSO) 
alone. Data points represent the mean of assays performed in tripli- 
cate. Similar results were obtained in two independent experiments. 



for nuclear receptors induce their interaction with pro- 
teins required for optimal transcriptional activation, the 
so-called coactivator proteins (Horwitz et al.. 1996). 
These ligand-dependent interactions have been ex- 
ploited as a biochemical assay for demonstrating direct 
interactions between ligands and their cognate recep- 
tors (Krey et al., 1997). 

The steroid receptor coactivator protein-1 (SRC-1) 
has been shown to interact directly with both steroid 



and nonsteroid nuclear hormone receptors in a ligand- 
dependent fashion (Onate et al., 1995; HorwiU et a!., 
1996; Kamei et al., 1996; Takeshita et al., 1996). The 
Interaction of SRC-1 with nuclear receptors is depen- 
dent upon the amino acid motif LXXLL found in multiple 
copies in SRC-1 (Heery et al., 1 997; Torchia et al., 1 997). 
In an effort to determine whether the steroid activators 
of PXR serve as ligands for this orphan receptor, we 
tested whether a 14 kDa fragment of SRC-1 (SRC-1. 14) 
containing three LXXLL motifs interacted with PXR in a 
steroid-dependent manner. 

SRC-1, 14 was expressed in vitro and labeled with 
("Sl-methionine and I"S]-cysteine, and the LBD of 
PXR.1 was expressed in E. coli as a fusion protein with 
glutathione-S-transferase (GST). Coprecipitation exper- 
iments were performed in the presence of the most 
potent PXR activators including 6,16a-dimethyl preg- 
nenolone, dexamethasone-t-butylacetate, and PCN. 
["Sl-SRC-1.14 interacted only weakly with the GST- 
PXR.1LBD fusion protein in the absence of added com- 
pound (Figure 7 A). The interaction of ["S]-SRC-1.14 with 
GST-PXR.1LBD was significantly enhanced in the pres- 
ence of either dexamethasone-t-butylacetate, 6,16a- 
dimethyl pregnenolone, or PCN (Figure 7A). Additional 
experiments performed with 6,16a-dimethyl pregneno- 
lone and PCN revealed that these steroids promoted 
("S1-SRC-1.14/PXR.1 LBD interactions in a dose-depen- 
dent manner (Figure 7B). Consistent with the results of 
the transfection studies, 6,16a-dimethyl pregnenolone 
was more potent than PCN in promoting these interac- 
tions (Figure 7B). Estradiol did not stimulate interactions 
between PXR.1 and ["Sl-SRC-1.14, indicating that the 
enhancement was specific for compounds that activate 
PXR.1 in the transfection assay (Figure 7A). In control 
experiments, estradiol promoted the efficient interaction 
of ["Sl-SRC-1.14 with a GST-ERaLBD fusion protein 
(Figure 7A). However, GST-ERaLBD interactions with 
("S]-SRC-1 .14 were not induced in the presence of PCN, 
dexamethasone-t-butylacetate, or 6,1 6a-dimethyl preg- 
nenolone. Taken together, these data provide strong 
evidence that PCN, dexamethasone-t-butylacetate, and 
6,16a-dimethyl pregnenolone serve as ligands for PXR. 

Discussion 

Identification of a Novel Signaling Pathway 
for Steroids 

It was first observed over 50 years ago that repeated 
administration of high doses of certain steroids, includ- 
ing progestins and androgens, reduced their own and 
each others toxic effects. These early observations, to- 
gether with the later findings that resistance to numer- 
ous drugs is sex dependent, that castration of rodents 
leads to increased drug sensitivity, and that liver homog- 
enates of intact male rats metabolize certain drugs more 
rapidly than homogenates prepared from castrated ani- 
mals led to the concept of "catatoxic" steroids; that 
is, steroids that confer resistance to specific toxins by 
accelerating their metabolism (reviewed by Kourounakis 
et al., 1977). It was speculated that catatoxic agents 
might have utility in the treatment of patients suffering 
from either drug intoxication or from diseases caused 
by endogenous substances liable to metabolism (e.g., 
Cushing's syndrome) (Kourounakis et al., 1977). 
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Figure 7. Steroids Induce PXR Interactions with a Fragment of the 
Coactivator Protein SRC-1 

(A) Coprecipitation experiments were performed with bacterially ex- 
pressed GST-PXR.l LBD (upper panel) or GST-ERaLBD (lower panel) 
and ("Sl-SRC-1.14 synthesized in vitro. ("S]-SRC-1 .14 was mixed 
with either GST-PXR.l LBD or GST-ERaLBD in the presence of vehi- 
cle alone (1% DMSO) or 10 ^jlM of estradiol, PCN, dexamethasone- 
t-butylacetate. or 6,16a-dimethyl pregnenolone as indicated. ["S]- 
SRC-1.14 complexed with either GST-PXR.ILBD or GST-ERoLBD 
was precipitated with glutathione-sepharose beads as described in 
Experimental Procedures. A lane representing 10% of the input ["S]- 
SRC-1 .14 in each reaction is shown on the left. Western blot analysis 
with an anti-GST antibody revealed that comparable amounts of 
GST-PXR.l LBD and GST-ERaLBD fusion proteins were used in the 
assays (data not shown). 

(B) Dose-response analysis was performed with ("SJ -SRC -1.1 4 and 
GST-PXR.l LBD in the presence of the Indicated concentrations of 
6.1 6a -dimethyl pregnenolone (squares) or PCN (circles). ("SJ-SRC- 
1.14 was quantitated via scanning densitometry and plotted as a 
percent of the ("Sl-SRC-1.14 precipitated in the presence of 10 jiM 
6,1 6a -dimethyl pregnenolone. Data shown represent the average 
of duplicate points, and similar results were obtained in two separate 
experiments. 

A systematic analysis of steroids in the early 1970s 
identified the pregnenolone derivative PCN as the most 
potent catatoxic compound among those tested (Selye, 
1971). Insight into the mechanism underlying the cata- 
toxic effects of PCN was provided by the demonstration 
that this synthetic steroid induces the expression of 
CYP3A1 and CYP3A2. two closely related members of 
the P450 family of monooxygenases (Elshourbagy and 
Guzeliaa 1980; Heuman et al., 1982; Hardwick et al.. 
1983; Schuetz and Guzeliaa 1984; Schuetz etal., 1984; 
Gonzalez et al., 1985). The CYP3A hemoproteins have 
a remarkably broad substrate specificity, hydroxylating 
a variety of xenobiotics such as cyclosporin, warfarin, 
and erythromycin, as well as endogenous steroids in- 
cluding Cortisol, progesterone, testosterone, and DHEA- 
sulfate (Nebert and Gonzalez, 1987; Juchau, 1990). Sub- 
sequent studies with the cloned C/P5iA7gene promoter 
identified a PCN response element that was highly con- 
served in the CYP3A2gene promoter (Miyata et al., 1995; 



Quattrochi et al., 1995). This response element was com- 
posed of two copies of the nuclear receptor half-site 
consensus sequence AGTTCA organized as a DR. 

In addition to PCN, the expression of CYP3A1 was 
also shown to be induced by dexamethasone both in 
vivo and in cultured hepatocytes (Heuman etal., 1982; 
Schuetz and Guzeliaa 1984; SchueU et al., 1984). How- 
ever, the concentrations of dexamethasone required to 
induce CYP3A gene expression were higher than those 
typically required to activate the classical GR signaling 
pathway (Schuetz and Guzelian. 1984; Schuetz et al., 
1984). Promoter mapping studies showed that dexa- 
methasone induced CYP3A1 gene expression through 
the same DR response element as PCN (Quattrochi et 
al., 1995; Huss et al., 1996). Thus, paradoxically, both 
high concentrations of dexamethasone, a glucocorti- 
coid, and PCN, an antiglucocorticoid, induced the ex- 
pression of the CYP3A1 gene through the same re- 
sponse element. 

We now provide several lines of evidence indicating 
that the orphan nuclear receptor PXR is responsible for 
mediating the inductive effects of PCN and dexametha- 
sone on CYP3A gene expression. First, both dexametha- 
sone and PCN are efficacious activators of the PXR.1 
isoform. Second, PXR binds efficiently as a heterodimer 
with RXR to the conserved DR-3 motifs identified in the 
CYP3A1 and CYP3A2 gene promoters as glucocorticoid 
and PCN response elements. Finally, we detected PXR 
expression in only a few tissues, including liver, intes- 
tine, and kidney. These are the primary tissues in which 
the CYP3A genes are expressed and induced in re- 
sponse to either dexamethasone or PCN treatment 
(Heuman et al., 1982; Debri et al., 1995). Our data thus 
support the existence of a novel signaling pathway for 
synthetic glucocorticoids and provide a mechanistic ex- 
planation for the long-standing paradox as to how both 
GR agonists and antagonists can exert similar effects 
on C/P3>^gene expression. Moreover, the identification 
of PXR.1 as the PCN receptor provides the tool neces- 
sary for the rapid identification of novel pharmacological 
agents with more potent catatoxic activities. 

In addition to inducing CYP3A gene expression, PCN 
is also known to have marked effects on hepatic choles- 
terol homeostasis in rodents. These effects include sig- 
nificant decreases in the levels of HMG-CoA reductase 
and cholesterol 7a-hydroxylase gene expression with 
concomitant reductions in sterol biosynthesis and bile 
acid secretion (von Bergmann et al,. 1975; Mason and 
Boyd, 1978; Turley and Dietschy, 1984; Stahlberg, 1995). 
PCN has also been reported to enhance the formation of 
cholesterol-esters and the hypersecretion of cholesterol 
into the bile (von Bergmann et al„ 1975; Turley and 
Dietschy, 1984). Thus, PCN affects key aspects of cho- 
lesterol metabolism including its biosynthesis, storage, 
and secretion. Although we cannot exclude the possibil- 
ity that some of its biological effects might be mediated 
through the GR or other steroid receptors, it is tempting 
to speculate that PCN is mimicking the actions of an 
endogenous hormone that serves to regulate coordi- 
nately steroid and sterol metabolism through the activa- 
tion of PXR in tissues such as liver and intestine. Our 
data raise the possibility of the existence of regulatory 
loops through which endogenous PXR hormones feed 
back to regulate cholesterol homeostasis and feed for- 
ward to regulate steroid homeostasis. 
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What is the natural ligand for PXR? Pregnenolone 
and progesterone are among the most potent naturally 
occumng PXR activators that we have identified, acti- 
vating PXR.1 at low micromolar concentrations. Preg- 
nenolone is one of the most abundant steroids in 
mammals, circulating in humans and rodents at concen- 
trations that range from roughly 1 to 50 nM (Punjabi 
et al., 1983; Wichmann et al., 1984; Tietz, 1995). While 
progesterone levels in human serum are generally in 
the 1-10 nM range, concentrations can exceed 700 nM 
during the third trimester of pregnancy (Tietz, 1995). 
These levels approach those required to activate PXR.1 
in vitro. Nevertheless, it remains unclear whether con- 
centrations of progesterone or pregnenolone sufficient 
to activate PXR.1 are achieved either in serum or in 
tissues under normal physiological conditions. Our re- 
sults with PXR may thus be analogous to those obtained 
with R;<R, which was first shown to be activated by 
micromolar concentrations of all -trans retinoic acid 
(t-RA) prior to the identification of 9-c/5 RA as a high- 
affinity ligand (Mangelsdorf et al., 1990; Heyman et al„ 
1992; Levin et al., 1992). Based upon our findings that 
PXR is activated by pregnenolone and its metabolites 
and that the synthetic steroid 6,16a-dimethyl pregneno- 
lone activates PXR with an ECjo value of 300 nM, we 
suggest that the natural PXR ligand is likely to be a 
pregnane. 

Perspectives 

With the isolation of the androgen receptor in 1988 
(Chang et al., 1988; Lubahn et al., 1988), receptors for 
all of the known nuclear-acting steroid hormones had 
been cloned. However, studies performed during the 
past two years with orphan members of the nuclear 
receptor family have suggested that additional sterols 
are likely to serve as mammalian hormones. For exam- 
ple, the orphan receptors LXR and SF-1 were recently 
shown to be activated by physiological concentrations 
of several oxysterol metabolites of cholesterol (Janow- 
ski et al., 1996; Lala et al., 1997; Lehmann et al., 1997). 
While the biological role of LXR remains less clear, SF-1 
is essential for adrenal and gonadal development and 
regulates the expression of genes required for steroido- 
genesis (Luo et al., 1994). These data suggested an 
unexpected hormonal function for oxysterols in the reg- 
ulation of steroidogenesis. 

We have now identified PXR as a novel member of 
the nuclear receptor family that is efficaciously activated 
by both natural and synthetic steroids. The activation 
profile of PXR is distinct from any of the other steroid 
hormone receptors identified to date, suggesting that 
this orphan receptor defines a novel endocrine signaling 
pathway. We conclude that the identification of PXR 
provides additional evidence for an expanded role for 
steroid hormones in mammalian physiology and that the 
elucidation of the biological role of PXR is likely to lead 
to a better understanding of how steroids elicit their 
myriad effects. 

Experimental Procedures 

Chemicals 

Dexamethasone-t-butylacetale and RU486 were purchased from 
Research Plus. Inc. (Bayonne, NJ) and Biomol (Plymouth Meeting, 



PA), respectively. Ail other steroids were purchased from either 
Sigma Chemical Co. (St Louis, MO) or Steraloids, Inc. (Wilton, NH). 

Molecular Cloning of PXR cDNAs 

Partial mouse sequence for PXR was obtained from an EST depos- 
rted in the Washington University/HHMI EST daubase (accession 
number AA27 7 370). A 19 bp oligonucleotide derived from this EST 
sequence (5' TCTCCCCAGATCGTCCTGG 3') was used to screen a 
pCMV-SPORT mouse liver cDNA library (GIBCO-BRL) using Gene 
Trapper solution hybridization cloning technology (GIBCO-BRL)- 
Five clones were obtained that ranged in size from 1 .0 kb to 2.7 kb. 
Four of these clones encoded PXR.1 and one encoded the DBD 
and LBD of PXR.2. A clone encoding the full-length PXR.2 isoform- 
coding region (nucleotides 151 to 1443) was subsequently isolated 
from mouse liver cDNA through PCR. Sequences were aligned and 
analyzed by the University of Wisconsin Genetics Computer Group 
programs. 

Plasmids 

The expression plasmids pSG5-GAL4-PXR.1LBD and pSG5-GAL4- 
PXR.2LBD were generated by amplification of cDNA encoding 
amino acids 105-431 of PXR.1 or 105-390 of PXR.2 by PCR and 
insertion into a modified pSG5 expression vector (Stratagene) con- 
taining DNA encoding the DBD of GAL4 (amino acids 1-147) and 
the SV40 Tag nuclear localization signal (APKKKRKVG) inserted 
upstream of a multiple cloning site. The (OAS) ^Ak-C AT xepovXev plas- 
mid has been previously described (Lehmann et aL, 1995). The ex- 
pression vectors pSG5-PXR.l and pSG5-PXR.2 were generated by 
amplification of cDNA encoding amino acids 1-431 of PXR.1 or 
1-390 of PXR.2 and insertion into pSGS. The reporter plasmid 
(CYP3A1}2-tk-CAT \fjas generated by insertion of two copies of a 
double-stranded oligonucleotide containing the CYP3A1 DR-3 RE 
(5' GATCAGACAGTTCATGAAGTTCATCTAGATC 3') (Quattrochi et 
al., 1995; Huss etal., 1996) into the BamHI site of pBLCAT2 (Luckow 
and SchCitz, 1987). The bacterial expression vector pGEX-PXR.ILBD 
was generated by PCR amplification of cDNA encoding amino acids 
105-431 of PXR.1 and insertion into a pGEX-2T vector (Pharmacia) 
that had been modified to contain Kpnl and Not! restriction sites. 
The bacterial expression vector pGEX-ERaLBD was generated by 
PCR amplification of cDNA encoding amino acids 251-595 of human 
ERa (Green et al., 1986) and insertion into pGEX-2T. The expression 
plasmid for the SRC-1. 14 fragment was generated by PCR amplifica- 
tion of DNA encoding amino acids 632-754 of human SRC-1 (Take- 
shita et al.. 1996) and insertion into the expression vector pRSETC 
(Invitrogen). All constructs were confirmed by sequence analysis. 

Cotransfection Assays 

CV-1 cells were plated in 24-well plates in DME medium supple- 
mented with 10% charcoal-stripped fetal calf serum at a density of 
1.2 X 10* cells per well. In general, transfection mixes contained 33 
ng of receptor expression vector, 100 ng of reporter plasmid, 200 
ng of p-galactosidase expression vector (pCH1 10, Pharmacia), and 
J 66 ng of carrier plasmid. Cells were transfected overnight by lipo- 
fection using Lipofectamine (Life Technologies, Inc.) according to 
the manufacturer's instructions. The medium was changed to DME 
medium supplemented with 1 0% delipidated calf serum (Sigma) and 
cells were incubated for an additional 24 hr. Cell extracts were 
prepared and assayed for CAT and p-galactosidase activities as 
described previously (Lehmann et al., 1995). 

Northern Analysis 

An approximately 1.0 kb fragment encoding the LBD of PXR.1 (nu- 
cleotides 463-1446) was '^-labeled by random priming using the 
T7 Quick-Prime kit (Pharmacia). This radiolabeled fragment was 
used to probe mouse multiple-tissue Northern blots (OriGene, Rock- 
ville, MO). Blots were hybridized in ExpressHyb solution (Clontech, 
Palo Alto, CA) at 65*C for 18 hr. Final washes were performed with 
0.1 X SSC, 0.1% SDS at 6S*'C. 

In Situ Hybridization Analysis 

Embryonic day 18 (El 8) mice (C57bl/CBA and NMRI, Bomholt 
Breeding and Research Center, Copenhagen, Denmartc) were used 
in the studies. Tissues were sectioned at 1 4 jun and thaw-mounted 
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onto slides (ProbeOn, Fischer). Three different oligonucleotide 
probes (Pharmacia Biotech, Sollentuna, Sweden) designed to hy- 
bridize 10 PXR mRNA were used. Oligonucleotides were radiola- 
beled using ("SJ-deoxyadenosine 5'-a-thto-triphosphate (NEN) at 
the 3' end using terminal deoxynucleotidyl transferase (Amersham). 
Labeled oligonucleotides were hybridized to tissue sections and 
mRNA expression was Hrst detected by phosphoimaging (BASF 
3000 Phosphoimager, Fuji) followed by film emulsion autoradiogra- 
phy (Dagertind et al., 1992). Sections were examined using light- 
and dark-field microscopy (Axiophot. Zeiss) and photographed 
(Ektachrome 64T, Kodak). Positives were scanned (SprintScan 35, 
Polaroid) and processed using Photoshop and PageMaker, Adobe. 
The following oligonucleotide sequences were used as probes: 
probe 1 (PXR.1 and PXR.2 specific). 5' GGAGCCTCAATCTTTTCCC 
TCTTCTTCCTCTTGATCAAGGCCCGC 3'; probe 2 (PXR.1 specific), 
5' CTTCCAACAGTGAGGCCTGCAGAAACTCTGGAAGCTCACAGC 
CAC 3'; probe 3 (PXR.1 and PXR.2 specific), 5' TGGGCTCTTCCAAG 
GCAGAGTCTGCTTCT TC ACACTGTAC AAGGCC 3 '. A 50 bp random 
oligonucleotide was used as a negative control. 

Gel Mobility Shift Assays 

PXR.1, PXR.2, and RXRa were synthesized in vitro using the TNT 
rabbit reticulocyte lysate coupled in vitro transcription/translation 
system (Promega) according to the manufacturer's instructions. Gel 
mobility shift assays (20 contained 10 mM Tris (pH 8.0), 40 mM 
KCI, 0.05% NP-40. 6% glycerol, 1 mM DTT. 0.2 iig of poly(dl-dC), 
and 2.5 \s.\ each of in vitro-synthesized PXR and RXR proteins. The 
total amount of reticulocyte lysate was maintained constant in each 
reaction (5 (j,I) through the addition of unprogrammed lysate. Com- 
petitor oligonucleotides were included at a 5-fold or 25-fold excess 
as indicated in the figure legends. After a 10 min incubation on ice, 
10 ng of "P -labeled oligonucleotide was added and the Incubation 
continued for an additional 10 min. DNA-proiein complexes were 
resolved on a 4% polyacrylamtde get in 0.5 x TBE (1 y TBE = 90 
mM Tris, 90 mM boric acid. 2 mM EDTA). Gets were dried and 
subjected to autoradiography at -70"C. The following double- 
stranded oligonucleotides were used as either radiolabeled probes 
or competitors: CYP3A1, 5' GATGCAGACAGTTCATGAAGTTCATCT 
AGATC 3' (Quattrochi et al., 1995); ^CYP3Ah 5' GATCAGACAGAAC 
ATGAAGAACATCTAGATC 3': CYP3A2. 5' GATCAAACAGTTCATAA 
AGTTCATCTAGATC 3' (Miyata et al.. 1995). 

SRC-1.14 Coprecipitation Assay 

GST-PXR.1LBD and GST-ERaLBD fusion proteins were expressed 
in BL21(DE3)plysS cells and bacterial extracts prepared by one cycle 
of freeze-thaw of the cells in protein lysis buffer containing 10 mM 
Tris (pH 8.0). 50 mM KCI, 10 mM DTT. and 1% NP-40 followed by 
centrifugation at 40,000 x g for 30 min. Glycerol was added to the 
resulting supernatant to a final concentration of 10%. Lysates were 
stored at -SOX. ["S]-SRC-1.14 was generated using the TNT rabbit 
reticulocyte system (Promega) in the presence of Pro-Mix (Amer- 
sham). Coprecipitation reactions included 25 ^il of lysate containing 
GST-PXR.1LBD orGST-ERaLBD fusion proteins or control GST; 25 
[l\ incubation buffer (50 mM KCI. 40 mM HEPES fpH 7.5]; 5 mM 
p-mercaptoethanol; 1% Tween-20, 1% non-fat dry milk); 5 ji-l I"S]- 
SRC-1.14; and either PCN; dexamethasone-t-butylacetate; 6,16a- 
dimethyl pregnenolone; estradiol; or control DMSO. The mixtures 
were incubated for 25 min at 4"C with gentle mixing prior to the 
addition of 15 m-' of glutathione-sepharose 48 beads (Pharmacia) 
that had been extensively washed with protein lysis buffer. Reac- 
tions were incubated with gentle mixing at 4°C for an additional 20 
min. The beads were pelleted at 2000 rpm in a microfuge and washed 
3 times with protein lysis buffer containing either PCN; dexametha- 
sone-t-butylacetate; 6,16a-dimethyl pregnenolone; estradiol; or 
control DMSO. After the last wash, the beads were resuspended in 
25 M.I of 2 X SDS-PAGE sample buffer containing 1 mM DTT. Samples 
were heated at 100"C for 5 min and loaded onto a 10% Bis-Tris 
PAGE gel. Autoradiography was performed for 2-4 days. Assays 
were quantitated using a Molecular Dynamics Computing Densitom- 
eter and Image Quant software. 
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Saturation mutagenesis of the mature human inter- 
leukin-la (IL-lcr) gene has been performed. Following 
expression in Escherichia coli, the biological and 
receptor binding activities of the mutant proteins were 
examined. Most of the molecule could be altered with 
little effect on either function. More than 3,600 mu- 
tants were examined, and only 23 unique amino acid 
sequences were identified which resulted in an altered 
ratio of biological to binding activity when compared 
with wild-type ELr-la. These proteins possessed muta- 
tions at 38 of the 159 amino acid residues in IL-la, 
Random mutagenesis at several of these positions iden- 
tiHed further substitutions that affected activity. Ex- 
amination of a model for IL-la localized most of the 
residues which altered activity along one face of the 
molecule. This region appears to be distinct from areas 
of IL-1 which have been postulated to make contact 
with IL-1 receptor. 



Interleukin-l (IL-1)* is a potent cytokine that is involved 
in inflammatory responses and affects the growth and differ* 
entiation of T cells, B cells, and fibroblasts (for review, see 
Durum et aL, 1985). The two molecules responsible for this 
activity, IL-la and IL-1)3, share only 22% amino acid similar- 
ity (March etai, 1985; Auron et aL, 1987). Each binds to both 
forms of IL-1 receptor (Dower et oL, 1986; McMahan et oL, 
1991). Both IL-1 molecules are produced as intracellular 
precursors and are subsequently processed to mature proteins. 
Although the precursor form of IL-la is biologically active, 
only the mature form of IL-ip has any biological activity 
(Mosley et aL, 1987a, 1987b). cDNAs have been isolated 
encoding a third form of IL-1, IL-1 receptor antagonist (IL- 
Ira) (Carter et oL, 1990; Eisenberg et aL, 1990; Haimum et aL, 
1990). This molecule has homology to both IL-la and IL-1^ 
and has an affinity for the IL-1 receptors close to that seen 
for IL-lo and IL-1 A yet elicits no biologiccd response from 
target ceUs (Arend et aL, 1990; Carter et al., 1990; Eisenberg 

* This work was supported by funding from a joint venture between 
Immunex CoTporation and Eastman Kodak. The costs of publication 
of this article were defrayed in part by the payment of page charges. 
This article must therefore be hereby marked ^^advertisement" in 
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et aL, 1990; McMahan et aL, 1991). Alignments among these 
three sequences for several species do not indicate which 
residues are important for activity (Yanofsky and Zurawaki, 
1990). The three-dimensional structures of IL-la (Graves et 
cL, 1990) and IL-1/3 (Gilliland etaL, 1987; Priestle et aL, 1990) 
demonstrate the structural similarity of the two molecules but 
do not suggest which re^ons of the molecules are responsible 
for activity. 

Deletion and combinatoricd mutagenesis have identified 
residues at the amino terminus of IL-la which are needed for 
biological activity (Yanofsky and Ziurawski, 1990). However, 
since the affinities of these mutants for IL-1 receptors were 
not examined, it is impossible to differentiate whether these 
mutations affect biological activity, the ability to bind IL-1 
receptor, or the structural integrity of the protein. The exist- 
ence of IL-lra demonstrates the ability to separate biological 
activity from binding activity. Although many site-directed 
mutations of IL-la and IL-1^ have little effect on the function 
of the proteins (Gronenborn et ol, 1988; Kamogashira et oL^ 
1988a, 1988b; Craig et aL, 1989). several mutants demonstrate 
greatly reduced biological activity with little change in affinity 
for the type I IL-1 receptor (Gehrke et aL, 1990; Yamayoshi 
et aL, 1990). 

Using a novel method of saturation mutagenesis, random 
mutations were generated throughout the entire sequence of 
IL-la. Assays to determine both biological and binding activ- 
ity were performed on several thousand muttuit proteins. By 
examining the ratio of biological to binding activity for each 
mutant and comparing it with the ratio for wild-type IL-la, 
regions of IL-la which affect these two properties differen- 
tially were identified. 

MATERIALS AND METHODS 

Emymes and Vectors— All restriction enzymes, T4 DNA ligase, 
and T4 polynucleotide kinase were obtained from Boehringer Mann- 
heim or New England Biolabs. BBGl, a plaamid bearing a synthetic 
gene for human IL-la, was purchased from British Biotechnology. 
The construction of pPLBBGIL-la has been described previously 
(Poindexter et aL, 1991). An 5pel site was added using site-directed 
mutagenesis. Plasmid DNA was purified by the alkaline lysis method 
(Ausubel et aL, 1988). 

DNA Synf/xe^is— Oligonucleotide cassettes used for the construc- 
tion of mutants were synthesized on an Applied Biosystems model 
380A DNA synthesizer. For saturation mutagenesis each of the four 
phosphoramidites was contaminated with a small amount of the other 
three. The phosphoramidites were contaminated at two different 
levels. 4.2% for the sense strand and 8.4% for the antisense strand 
(Poindexter et aL, 1991). For random mutagenesis, the cassette was 
synthesized normally except for the substitution of an equimolar 
mixture of the four phosphoramidites for the three nucleotides mak- 
ing up the chosen codon. Oligonucleotides were purified by polyacryl- 
amide gel electrophoresis on a 40-cm 8% potyacrylamide, 7 M urea 
gel. Care was taken to excise full-length oligonucleotides, and the 
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Fig. 1. Coding 8e<iuence of human 
IL-la gene used in saturation mu- 
tagenesis. The DNA sequence down- 
stream from the promoter of 
pPLBBGIL-la is shown (accession no. 
X55445). The coding sequence is repre- 
sented by capital letters. The coding se- 
quence for the IL-lor gene is divided into 
eight regions, named for the restriction 
enzymes that border them. The nine re- 
striction enrymes are Clal (C), EcoRl 
(R), Pstl (P), Sstl iSl Pvull (U), BamHI 
(B), Spel (S), Bga {Gh and HindlU (H). 
Thus the eight regions are CR. RP, PS, 
SU» UB. BS. SG, and OH. 
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Fig. 2. Ability of biolo^cal and 
binding activity screens to identify 
mutants with altered activity. Panel 
A, 96 identical samples of wild-type IL- 
la. Panel B, 96 independent samples of 
wild-type IL-la. Panel C, 90 mutants of 
IL-la from the PS region {open circles) 
and 6 wild-type IL-lnr {closed circles). 
Panel D, 90 mutants of IL-lo from the 
RP region {open circles) and 6 wild-type 
IL-lof {closed circles). The biological ac- 
tivity is expressed in units/ml, and the 
binding activity is the reciprocal dilution 
that results in 50% binding inhibition 
(see ** Materials and Methods"). 



Binding Activity 



Binding Activity 



oligonucleotides were deprotected and resuspended in TE (10 mM 
Tris-HCl. pH 7.4. 1 mM EDTA). 

Assembly and Cloning of Mutagenic Oligonucleotides —The general 
procedure has been described (Poindexter et oL, 1991). TSventy pico- 
moles of each oligonucleotide in a mutagenic cassette were mixed in 
20 ill of TE and placed at 65 for 16 min. The mixture was allowed 
to cool slowly to room temperature and then placed on ice. Each 



mutagenic cassette had unique ends, allowing them to be ligated into 
appropriately cleaved vectors. To increase the efficiency of screening, 
intermediate vectors were constructed for each repon. These inter- 
mediate vectors contained an irrelevant segment of DNA inserted 
between the relevant restriction enzyme sites. Insertion of the mu- 
tagenic cassette followed by restriction with an enzyme unique to the 
intermediate plaamid greatly reduced the incidence of vectors without 



Identification of Regions in IL-la Important for Activity 



22107 



O) 

c 

C 

C9 



CD 
O 



IXIO'- 



(A) 



Mi 




ii 



Mutants 

(B) 



1 





Mutants 

Fig. 3. Mutants with altered activities. The activity ratios 
were normalized to the internal wild- type IL-la controls and the 
mean from at least three screens for mutants with normalized ratios 
10-fold g:reater or less than IL-la are shown. Panel A, proteins 
generated by saturation mutagenesis. Panel B, proteins generated by 
site-directed random mutagenesis at single amino acid residues. The 
single letter amino acid code is used to describe the amino acid in 
wild-type IL-la, the number of the residue, and the amino acid of the 
mutant. An asterisk indicates single amino acid changes that were 
also isolated from the saturation mutagenesis screens. 



a cassette inserted. The ligation mixture was transformed into 
GMl[pRK248]. The transformants were screened for insertion of the 
mutagenic cassette by colony hybridization (Poindexter et oL, 1991), 
Double -stranded sequencing of the vectors with inserts was performed 
using the dideoxy method (Sanger et o/., 1977). 

Protein Expression and Analysis — Mutant proteins were produced 
by using a pH induction protocol (Poindexter and Gayle, 1991). Cells 
containing recombinant plasmids were inoculated into 24-well plates 
containing I ml of Superbroth (Ausubel et oL, 1988), supplemented 
with M9 minimal salts and 1% glucose. Following overnight growth 
at 30 *C. the pH of the medium was shifted to 9 by the addition of 5 
M NaOH. The cells were grown at 30 'C at pH 9 for 18 h before the 
cells were pelleted. Oils (40 ft\) were spun down in a 96-well plate at 
3.000 rpm for IS min. The cells were resuspended in an equal volume 
of lysis buffer (125 mM Tria, pH 8, 2% SDS) and then 80 of 
phosphate-buffered saline (137 mM NaCl, 2.7 mM KCl, 4,3 mM 
Na,HPO4.7H20. 1.4 mM KHjHPO^, pH 7.4) was added. The samples 
were then used for biological or binding assays. Binding activity was 
determined by the capacity of Escherichia coli cell lysates to inhibit 
the binding of radiolabeled IL-la to EL4 cells (Mosley et oL, 1987b). 
Biological activity was examined using an EL4 conversion assay 
(Mosley et al, 1987a). E. coli lysates containing wild-type IL-la were 
included in every assay as controls. The data were analyzed by a 
nonlinear least sqiiares fitting routine , and activities were quantified 
using a standard curve, derived from purified IL-la, for each biolog- 
ical assay and each binding assay. 

Modeling IL-la Structure — The three-dimensional model of IL-la 
used in this study is an all atom protein model built using the Ca 
coordinates generated from the stereo diagrams of the crystal struc- 
ture (Graves ef ai, 1990). The X and Y coordinates for all of the Ca 
atoms were measured in the user unit using one of the two figures in 
the stereo diagram. The known standard distance of 3.8 A, expected 
between the successive Ca atoms, was used to scale and assign the Z 
coordinates for all of the C« atoms. An all atom model of 169 residues 



was constructed in FRODO (Jones, 1985) from these 151 Ca coordi- 
nates (Leu-7 to A8n-157). The structure was refmed in a modified 
version of Biograf (Peeples and Goldstein, 1989). 

Distances between the successive Ca atoms were computed in the 
X- Y projection by measuring the X and Y coordinates from one of 
the two stereo diagrams. A value of +1, —1, or 0 was assigned to the 
Z coordinates of every Ca atom if it was above, below, or in the same 
depth as the previous Ca atom in the stereo view. The scale factor 
required to change the user unit to A unit was computed by equating 
the computed projection distance between two successive Ca atoms, 
with a sign of 0, to that of the standard distance of 3.8 A. The X and 
Y coordinates of all of the Ca atoms were multiplied with this scale 
factor, and the projection distance between the successive Ca coor- 
dinates was computed. All of the projection distances measured this 
way will be either less than or equal to 3.8 A because of the Z 
flattening. Appropriate Z coordinates were then assigned to each Ca 
atom such that the computed distance between successive Ca meas- 
ured 3.8 A. The computed Z coordinates were multiplied with the 
sign assigned to them. The cumulative error in the computation of 
the Z coordinates was corrected by a<Jjusting only the Z coordinates 
using a wire model. The accuracy of the computed coordinate was 
tested by superimposing the resulting Ca trace on to the structurally 
conserved regions of IL-1^ structure. The root mean square value for 
the structurally conserved region is 0.75 A. 

RESULTS 

Alteration of the IL-la Gene by Saturation Mutagenesis — 
The synthetic IL-la gene has unique restriction enzyme sites 
approximately every 60 base pairs, dividing the gene into 
eight different regions (Fig. 1). The mutants from each region 
of the molecule are described by the restriction enzymes found 
at each end of the region. Thus the eight regions are CR 
{Clal'EcoRl), RP {EcoRl-Pstl), PS {PsthSstll SU (Sstl- 
Puullh UB (PuulUBamKl), BS (BomHI-Spel). SG {Spel- 
BgH), and GH (Bg[l-Hin6lll), Saturation mutagenesis using 
cassettes for each region was performed throughout the entire 
molecule, relying on a technique that results in very low levels 
of wild-type sequences and roughly equal probabilities of one 
to five nucleotide changes in any one region (Poindexter et 
a/., 1991). The use of intermediate plasmids in the construc- 
tions, along with colony hybridizations (see ''Materials and 
Methods"), greatly improved the yield of recombinant vectors, 
allowing insert frequencies over 90% to be achieved. More 
than 3,500 mutants were generated by this approach, encom- 
passing the eight regions of the IL-la gene (Fig. 1). 

In one region (RP), 30 mutants were sequenced to deter- 
mine accurately the mutation frequencies (Poindexter et al., 
1991). Those 30 mutants averaged 2.5 amino acid changes 
each, and every amino acid that could be altered in this region 
was found to be changed at least once. Several amino acids 
had three or four different substitutions. Sequencing mutants 
in each of the other seven regions did not reveal any deviation 
from the expected mutation frequency. More than 110 mu- 
tants were sequenced, and approximately 70% of the amino 
acid residues in IL-la were altered at least once. At this rate 
of mutagenesis, screening approximately 520 mutants, or 
roughly 65 mutants from each of the eight regions, should 
result in a 99% chance that all of the amino acids in IL-la 
which could be changed were altered at least once (for a 
discussion of calculating these probabilities, see Hutchison et 
oL, 1986). Thus, examination of 3,500 mutants should sample 
multiple mutations at every possible amino acid. 

Screening of Mutant IL-la Profeirw— Determining both the 
biological activity and the ability to inhibit the binding of 
wild-type IL-la to the type I IL-1 receptor for each mutant 
allowed molecules to be identified which affected these two 
functions differentially. The ratio of biological activity to 
binding ability was examined. This ratio represents an intrin- 
sic specific activity of the molecule and therefore should be 
independent of protein concentration. To validate this ap- 
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Fig. 4. Sequence of mutants that 
alter activity. The amino acid se- 
quence of the televant cassettes is shown 
with the amino acid changes for each 
mutant underneath. The single letter 
amino acid code is used. 
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proach, the biological activity cmd binding activity of 96 
identical samples of an E, coll lysate containing wild-type IL- 
la were determined (Fig. 2A). Plotting the biological activity 
against the binding activity resulted in a cluster of points, 
with the greatest error being in the biological activity, presum- 
ably because of the greater inherent variation in this assay 
than in the binding assay. Screening £. coli lysates from 96 
different inductions, each producing wild-type IL-la, resulted 
in the greater licattcr with a tendency for the points to cluster 
along a line whose slope equaled the activity ratio of wild- 
type IL-la (Fig, 2B), Altering the concentration of wild-type 
IL-la only moves the ratio along this line. Dilutions were 
performed to verify that the activity seen was linear with 
respect to IL-la concentration (data not shown). The assays 
were able to distinguish levels of biological and binding activ- 
ity over a range greater than 1,000-fold. 

Mutants with a wild-type activity ratio should fall along 
the same line as wild-type IL-la controls included in each 
assay. Proteins with increased biological activity in compari- 
son to the amount of binding activity seen should fall above 
this line, whereas mutant proteins with decreased amounts of 
biological activity compared with the binding activity seen 
should fall below this line. For screening purposes, mutants 
were determined to have altered activity ratios if there was at 
least a 10- fold increase or decrease in the ratio relative to 
wild-type IL-la for at least three independent sets of assays. 
In addition, mutants whose E. coli lysates lacked activity in 
either assay were rescreened to verify the lack of activity. 

Approximately 1,700 clones were examined from the CR, 
PS, SU, and UB regions, encompassing more than one-half 
of the molecule. None of the mutants that demonstrated 



activity displayed any significant deviation from wOd-type 
levels. These regions have an average of 2,2 amino acid 
changes per mutant (data not shown), which is not signifi- 
cantly different from the expected mutation rate. Fig. 2C 
displays a set of typical data for 90 mutants from the PS 
region of the molecule. The majority of these proteins have 
activity ratios similar to wild-type IL-la. The other three 
regions give similar profiles. 

Although most of the 1,800 mutants examined from the 
other four regions (RP, BS, SG, and GH) had activity ratios 
within 10-fold of wild-type IL-la, each region included several 
mutants that had a ratio that deviated from the wild*type IL- 
la ratio by more than 10-fold. An analysis of 90 typical 
mutants from the RP region is shown in Fig. 2D. There are 
several mutants in this group which have significantly differ- 
ent activity ratios from wild-type IL-la. Although several of 
these mutants failed to maintain an altered activity ratio 
upon subsequent assays, many continued to have altered 
activity ratios. 

Mutants from saturation mutagenesis with activity ratios 
10-foId higher or 10-fold lower than wild-type are shown in 
Fig. 3A, IL-la showed itself to be extremely resilient to 
change. Only 24 unique DNA sequences, out of more than 
3,500 examined, produced protein that displayed a significant 
difference in activity from wild-type (Fig. 3A ), This represents 
less than 0.7% of the mutants examined. 

Of the 24 different mutants with altered ratios, 23 had 
unique amino acid sequences (Fig. 4). RP754 and RP761, 
although having the same amino acid sequence, have different 
DNA sequences. This demonstrates the power of this ap- 
proach since it was possible to identify two independent clones 
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Fig. 6. Ali^ment of human and murine IL-la, human and murine IL-lff, and IL-1 receptor antagonist. Residues that are 
conserved in at least three of the sequences are boxed. The /J-strands for IL-la are shown above the sequence. The numbering for human IL- 
la and IL-ljS is displayed on the right. Closed circles indicate amino acid residues that were changed in IL-la mutants with altered phenotypes. 
Open circles indicate residues in human IL-lfi which have been postulated to interact with the type I IL-1 receptor. 



Using site-directed mutagenesis, single amino acid changes 
that decrease or increase the ratio of biological activity to 
binding ability were found at several positions in the molecule. 
At most of these locations only the substitution of specific 
amino acids had any impact on activity. Screening up to 100 
isolates revealed only a small number of specific amino acid 
substitutions at Leu-24, Asn-26, Asn-29. Gly-117, Gln-136, 
and Phe-152 which altered the activity ratio. Substitutions at 
Asp-26 and Asn-108 were exceptions. The presence of any of 
10 amino acids at Asp-26 altered the activity ratio. Changes 
at Asp-26 which reduce the amount of biological activity have 
been reported (Yamayoshi et oL, 1990). This may indicate 
that an Asp residue at position 26 is required for wild-type 
activity. Several hydrophilic substitutions at Asn-108 did not 
appear to affect biological activity (Zurawski, 1991). However, 
placing hydrophobic residues at this position did affect the 
ratio, with a relative increase in the biological activity. 

Few other reports of IL-la mutagenesis have distinguished 
between changes that affect biological activity and those that 
also alter the interactions of the protein with the receptor. 
Mutations at Asp- 151 differentially affect biological activity 
and binding ability (Yamayoshi et ai, 1990). Examination of 
a large group of IL-la mutants has previously revealed the 
importance of A8p-26 and Asp- 151, but only the effects on 
biological activity were studied (Kawashima et al., 1992). 
Random combinatorial mutagenesis has identified amino 
acids in which substitutions have no effect on biological 
activity (Yanofsky and Zurawski, 1990). The lack of muta- 



tions at certain residues was used to infer the importance of 
these residues for biological activity. There was, however, no 
demonstration of the ability of these mutants to interact with 
the IL-1 receptor. Since only biological activity was examined, 
it is difficult to determine whether the constraints on amino 
acid residues were functional in nature or whether there were 
structural constraints as well. 
. Sequence alignments of human and mouse IL-la and IL- 
1^ are shown in Fig. 6. These molecules bind to the type I IL- 
1 receptor and are active in the EL4 conversion assay. Inclu- 
sion of IL-lra, a molecule that also binds to IL-1 receptor but 
produces no biological activity, in the alignment does not 
immediately reveal any obvious region that is responsible for 
the imcoupling of biological activity and the ability to bind 
receptor. The 39 amino acids identified in this report are also 
shown. Although many of these amino acids are found toward 
the COOH terminus of the molecule (which is the most highly 
conserved region of the molecule), there does not appear to 
be much selection for conserved residues. In fact, regions in 
which few amino acids are conserved, such as in streuid 2 or 
strand 8, have several residues that appear to be important 
for activity. Although there are no data demonstrating that 
IL-la and IL-1/3 interact with the type I IL-1 receptor in the 
same fashion, residues of IL-1/3 which have been shown to be 
important for biological activity are found in positions ho- 
mologous to some of the residues identified for IL-la (Gehrke 
et oL, 1990; Ju et aL, 1991). 

Interestingly, residues of IL-1^ which have been shown to 
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influence binding to the type I IL-1 receptor or have been 
proposed to interact with the receptor (Clore et aL, 1991; 
Grenfell et oL, 1991; Labriola-Tompkins et oL, 1991; Veera- 
pandian et oL, 1992) are generally located outside the regions 
of IL-la shown in Fig. 5. Out of 45 amino acids postulated to 
be involved in binding of IL-IP only 9 overlap with residues 
identified in this study (Fig. 6). IL-la and IL-1^ either have 
different regions interacting with the receptor, or there is a 
large region in IL-la and IL-1^ which is required for biological 
activity but not for high affinity interactions with the recep- 
tor. 

A superfamily of molecules with protein folding similar to 
IL-1 has been proposed (Graves et ai, 1990; Murzin et ai, 
1992). This superfamily includes certain proteinase inhibitors 
and heparin-binding growth factors, such as fibroblast growth 
factor. Two forms of fibroblast growth factor have been shown 
to fold in a very similar fashion to IL-la and IL-1^ (Ago et 
aL, 1991; Eriksson et aL, 1991; Zhang et al, 1991; Zhu et al, 
1991), displaying 12 /3-8trands with a pseudo 3-fold symmetry. 
Several regions of the fibroblast growth factor molecule have 
been identified which are important for activity. These areas 
occupy spatially similar regions of the fibroblast growth factor 
three-dimensional structure as the amino acids of IL-la iden- 
tified by saturation mutagenesis. The modes of interaction 
between members of the IL-1 superfamily and their respective 
receptors may involve similar regions of the folded protein. 

The manner in which IL-1 generates a biological response 
is complicated. There are two different ligands, an IL-1 an- 
tagonist and two forms of IL-1 receptor (Dower et aL, 1990). 
There is evidence for multiple pathways of signal transduction 
(for review see Sims et al., 1993). The effect of IL-1 on a 
particular cell type may depend on the receptor found on the 
cell and which signaling pathway is being used. Analysis of 
the effect of these mutations on the different biological re- 
sponses may be helpful in further elucidation of IL-1 signal 
transduction. 
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